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(57) Abstract 

The present invention relates to new genes encoding for the production of novel proteins involved in generation of reactive oxygen 
intermediates that affect cell division. The present invention also provides vectors containing these genes, cells transfectcd with these 
vectors, antibodies raised against these novel proteins, kits for detection, localization and measurement of these genes and proteins, and 
methods to detenmine the activity of drugs to affect the activity of the proteins of the present invention. 
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5 NOVEL MITOGENIC REGULATORS 

The U.S. Government has a paid-up license in this 
invention and the right in limited circumstances to require the 
patent owner to license others on reasonable terms as provided 
10 for by the terms of National Institutes of Health grants HL38206 

and HL58000. 

TECHNICAL FIELD 

The present invention relates to the field of normal 

15 and abnormal cell growth, in particular mitogenic regulation. 

The present invention provides the following: nucleotide 
sequences encoding for the production of enzymes that are 
mitogenic regulators; amino acid sequences of these enzymes; 
vectors containing these nucleotide sequences; methods for 

20 transfecting cells with vectors that produce these enzymes; 

transfected cells; methods for administering these transfected 
cells to animals to induce tumor formation; and antibodies to 
these enzymes that are useful for detecting and measuring levels 
of these enzymes, and for binding to cells possessing 

25 extracellular epitopes of these enzymes. 

BACKGROUND OF THE INVENTION 

Reactive oxygen intermediates (ROI) are partial 
reduction products of oxygen: 1 electron reduces O2 to form 
superoxide (Oj ), and 2 electrons reduce O2 to form hydrogen 
peroxide (H2O2). ROI are generated as a byproduct of aerobic 
metabolism and by toxicological mechanisms. There is growing 
evidence for regulated enzymatic generation of O2' and its 
conversion to HjOj in a variety of cells. The conversion of Oj* 
35 to H2O2 occurs spontaneously, but is markedly accelerated by 

superoxide dismutase (SOD). High levels of ROI are associated 
with damage to biomolecules such as DNA, biomembranes and 
proteins. Recent evidence indicates generation of ROI under 



30 
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normal cellular conditions and points to signaling roles for Oj' 
and H2O2. 

Several biological systems generate reactive 
oxygen. Phagocytic cells such as neutrophils generate large 
5 quantities of ROI as part of their battery of bactericidal 

mechanisms. Exposure of neutrophils to bacteria or to various 
soluble mediators such as formyl-Met-Leu-Phe or phorbol 
esters activates a naassive consumption of oxygen, termed the 
respiratory burst, to initially generate superoxide, with 

10 secondary generation of H2O2, HOCI and hydroxyl radical. The 

enzyme responsible for this oxygen consumption is the 
respiratory burst oxidase (nicotinamide adenine dinucleotide 
phosphate-reduced form (NADPH) oxidase). 

There is growing evidence for the generation of 

15 ROI by non-phagocytic cells, particularly in situations related to 

cell proliferation. Significant generation of H2O2, O^, or both 
have been noted in some cell types. Fibroblasts and human 
endothelial cells show increased release of superoxide in 
response to cytokines such as interleukin-1 or tumor necrosis 

20 factor (TNF) (Meier et al. (1989) Biochem J, 263, 539-545.; 

Matsubara et al. (1986) 7. Immun. 137, 3295-3298). Ras- 
transformed fibroblasts show increased superoxide release 
compared with control fibroblasts (Irani, et al. (1997) Science 
275, 1649-1652). Rat vascular smooth muscle cells show 

25 increased H2O2 release in response to PDGF (Sundaresan et al. 

(1995) Science 270, 296-299) and angiotensin II (Griendling et 
al. (1994) Circ. Res. 74, 1141-1148; Fukui et al. (1997) Circ. 
Res. 80, 45-51; Ushio-Fukai et al. (1996) 7. Biol. Chem. 271, 
23317-23321), and HjOj in these cells is associated with 

30 increased proliferation rate. The occurrence of ROI in a 

variety of cell types is summarized in Table 1 (adapted from 
Burdon, R. (1995) Free Radical Biol Med, 18, 775-794). 
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Table 1 



10 



15 



20 



25 



30 



35 



Superoxide 
human fibroblasts 
human endothelial ceils 
human/rat smooth muscle cells 
human fat cells 
human osteocytes 
BHK-21 cells 

human colonic epithelial cells 



Hydrogen Peroxide 
Balb/3T3 cells 
rat pancreatic islet cells 
murine keratinocytes 
rabbit chondrocytes 
human tumor cells 
fat cells, 3T3 Li cells 



ROI generated by the neutrophil have a cytotoxic 
function. While ROI are normally directed at the invading 
microbe, ROI can also induce tissue damage (e.g., in 
inflammatory conditions such as arthritis, shock, lung disease, 
and inflammatory bowel disease) or may be involved in tumor 
initiation or promotion, due to damaging effects on DNA. 
Nathan (Szatrowski et al. (1991) Cane. Res, 51, 794-798) 
proposed that the generation of ROI in tumor cells may 
contribute to the hypermutability seen in tumors, and may 
therefore contribute to tumor heterogeneity, invasion and 
metastasis. 

In addition to cytotoxic and mutagenic roles, ROI 
have ideal properties as signal molecules: 1) they are generated 
in a controlled manner in response to upstream signals; 2) the 
signal can be terminated by rapid metabolism of Oj and HjOj by 
SOD and catalase/peroxidases; 3) they elicit downstream effects 
on target molecules, e.g., redox-sensitive regulatory proteins 
such as NF kappa B and AP-1 (Schreck et al. (1991) EMBO J. 
10, 2247-2258; Schmidt et al. (1995) Chemistry & Biology 2, 
13-22). Oxidants such as O2 and H2O2 have a relatively well 
defined signaling role in bacteria, operating via the SoxI/II 
regulon to regulate transcription. 

ROI appear to have a direct role in regulating cell 
division, and may function as mitogenic signals in pathological 
conditions related to growth. These conditions include cancer 
and cardiovascular disease. 0{ is generated in endothelial cells 
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in response to cytokines, and might play a role in angiogenesis 
(Matsubara et al. (1986) 7. Immun. 137, 3295-3298). O2' and 
H2O2 are also proposed to function as "life-signals", preventing 
cells from undergoing apoptosis (Matsubara et al. (1986) J. 
5 Immun. 137, 3295-3298). As discussed above, many cells 

respond to growth factors (e.g., platelet derived growth factor 
(PDGF), epidermal derived growth factor (EGF), angiotensin 
n, and various cytokines) with both increased production of O2" 
/H2O2 and increased proliferation. Inhibition of ROI generation 
10 prevents the mitogenic response. Exposure to exogenously 

generated Oj" and HjOj results in an increase in cell 
proliferation. A partial list of responsive cell types is shown 
below in Table 2 (adapted from Burdon, R. (1995) Free 
Radical Biol Med. 18, 775-794). 

15 

Table 2 

Superoxide \ Hydrogen pemxide 

human, hamster fibroblasts mouse osteoblastic cells 

Balb/3T3 cells Balb/3T3 cells 

20 human histiocytic leukemia rat, hamster fibroblasts 

mouse epidermal cells human smooth muscle cells 

rat colonic epithelial cells rat vascular smooth muscle cells 

rat vascular smooth muscle cells 

25 While non-transformed cells can respond to growth 

factors and cytokines with the production of ROI, tumor cells 
appear to produce ROI in an uncontrolled manner. A series of 
human tumor cells produced large amounts of hydrogen 
peroxide compared with non-tumor cells (Szatrowski et al. 

30 (1991) Cane, Res. 51, 794-798). Ras-transformed NIH 3T3 

cells generated elevated amounts of superoxide, and inhibition 
of superoxide generation by several mechanisms resulted in a 
reversion to a "normal" growth phenotype. 

O2' has been implicated in maintenance of the 

35 transformed phenotype in cancer cells including melanoma, 

breast carcinoma, fibrosarcoma, and virally transformed tumor 
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cells. Decreased levels of the manganese form of SOD 
(MnSOD) have been measured in cancer cells and in vitro- 
transformed cell lines, predicting increased Oj' levels (Burdon, 
R. (1995) Free Radical Biol Med. 18, 775-794). MnSOD is 
5 encoded on chromosome 6q25 which is very often lost in 

melanoma. Overexpression of MnSOD in melanoma and other 
cancer cells (Church et al. (1993) Proc. of Natl Acad. Sci. 90, 
3113-3117; Femandez-Pol et al. (1982) Cane. Res. 42, 609-617; 
Yan et al. (1996) Cane. Res. 56, 2864-2871) resulted in 

10 suppression of the transformed phenotype. 

ROI are implicated in growth of vascular smooth 
muscle associated with hypertension, atherosclerosis, and 
restenosis after angioplasty. O2' generation is seen in rabbit 
aortic adventitia (Pagano et al. (1997) Proc. Natl Acad. Sci. 

15 94, 14483-14488). Vascular endothelial cells release O2' in 

response to cytokines (Matsubara et al. (1986) /. Immun. 137, 
3295-3298). O2" is generated by aortic smooth muscle cells in 
culture, and increased Oj" generation is stimulated by 
angiotensin II which also induces cell hypertrophy. In a rat 

20 model system, infusion of angiotensin II leads to hypertension as 

well as increased O,' generation in subsequently isolated aortic 
tissue (Ushio-Fukai et al. (1996) 7. BioL Chem: 111, 23317- 
2332L; Yu et aL (1997) J. Biol Chem. 272, 27288-27294). 
Intravenous infusion of a form of SOD that localizes to the 

25 vasculature or an infusion of an O2' scavenger prevented 

angiotensin II induced hypertension and inhibited ROI 
generation (Fukui et al. (1997) Circ. Res. 80, 45-51). 

The neutrophil NADPH oxidase, also known as 
phagocyte respiratory burst oxidase, provides a paradigm for 

30 the study of the specialized enzymatic ROI-geherating system. 

This extensively studied enzyme oxidizes NADPH and reduces 
oxygen to form O,'. NADPH oxidase consists of multiple 
proteins and is regulated by assembly of cytosolic and 
membrane components. The catalytic moiety consists of 

35 flavocytochrome bjjg, an integral plasma membrane enzyme 

comprised of two components: gp91phox (gp refers to 
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glycoprotein; phox is an abbreviation of the words phagocyte 
and oxidase) and p22phox (p refers to protein). gp91phox 
contains 1 flavin adenine dinucleotide (FAD) and 2 hemes as 
well as the NADPH binding site. p22phox has a C-terminal 
5 proline-rich sequence which serves as a binding site for 

cytosolic regulatory proteins. The two cytochrome subunits, 
gp91phox and p22phox appear to stabilize one another, since the 
genetic absence of either subunit, as in the inherited disorder 
chronic granulomatous disease (CGD), results in the absence of 

10 the partner subunit (Yu et al. (1997) J. Biol Chem. 272, 

27288-27294). Essential cytosolic proteins include p47phox, 
p67phox and the sniall GTPase Rac, of which there are two 
isoforms. p47phox and p67phox both contain SH3 regions and 
proline-rich regions which participate in protein interactions 

15 governing assembly of the oxidase components during 

activation. The neutrophil enzyme is regulated in response to 
bacterial phagocytosis or chemotactic signals by 
phosphorylation of p47phox, and perhaps other components, as 
well as by guanine nucleotide exchange to activate the GTP- 

20 binding protein Rac. 

The origin of ROI in non-phagocytic tissues is 
unproven, but the occurrence of phagocyte oxidase components 
has been evaluated in several systems by immunochemical 
methods, Northern blots and reverse transcriptase-polymerase 

25 chain reaction (RT-PCR). The message for p22phox is 

expressed widely, as is that for Racl. Several cell types that are 
capable of 0{ generation have been demonstrated to contain all 
of the phox components including gp91phox, as summarized 
below in Table 3. These cell types include endothelial cells, 

30 aortic adventitia and lymphocytes. 
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Table 3 



10 



Tissue 


eoPlphox 


p22phox 


p47phox 


p67phox 


neutrophil 


+ 1.2 


+ 1.2 


+ 1.2 


+1.2 


aoitic adventitia 


+' 


+' 


+v 


+' 


lymphocytes 




+^ 


+1.2 


+..2 


endothelial cells 




+^ 


+1.2 


+1.2 


glomerular mesangial 
cells 




. +'-^ 


+1.2 


+1.2 


fibroblasts 




+^ 


+ 1.2 


+^ 


aortic sm. muscle 




+1.2 


7 


? 



1= protein expression shown. 2= mRNA expression shown 

However, a distinctly different pattern is seen in 
15 several other cell types shown in Table 3 including glomerular 

mesangial cells, rat aortic smooth muscle and fibroblasts. In 
these cells, expression of gp91phox is absent while p22phox and 
in some cases cytosolic phox components have been 
demonstrated to be present. Since gp91phox and p22phox 

20 stabilize one another in the neutrophil, there has been much 

speculation that some molecule, possibly related to gp91phox, 
accounts for ROI generation in glomerular mesangial cells, rat 
aortic smooth muscle and fibroblasts (Ushio-Fukai et al. (1996) 
7. Biol Chem, 271, 23317-23321). Investigation of fibroblasts 

25 from a patient with a genetic absence of gp91phox provides 

proof that the gp91phox subunit is not involved in ROI 
generation in these cells (Emmendorffer et al. (1993) Eur, 1 
Haematol 51, 223-227). Depletion of p22phox from vascular 
smooth muscle using an antisense approach indicated that this 

30 subunit participates in ROI generation in these cells, despite the 

absence of detectable gp91phox (Ushio-Fukai et al. (1996) J. 
Biol Chem. 271, 23317-23321). At this time the molecular 
candidates possibly related to gp91phox and involved in ROI 
generation in these cells are unknown. 

35 Accordingly, what is needed is the identity of the 

proteins involved in ROI generation, especially in non- 
phagocytic tissues and cells. What is also needed are the 
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10 



20 



nucleotide sequences encoding for these proteins, and the 
primary sequences of the proteins themselves. Also needed are 
vectors designed to include nucleotides encoding for these 
proteins. Probes and PGR primers derived from the nucleotide 
sequence are needed to detect, localize and measure nucleotide 
sequences, including mRNA, involved in the synthesis of these 
proteins. In addition, what is needed is a means to transfect 
cells with these vectors. What is also needed are expression 
systems for production of these molecules. Also needed are 
antibodies directed against these molecules for a variety of uses 
including localization, detection, measurement and passive 
inmiunization. 



SUMMARY OF THE INVENTION 
15 The present invention solves the problems 

described above by providing a novel family of nucleotide 
sequences and proteins encoded by these nucleotide sequences 
termed mox proteins and duox proteins. In particular the 
present invention provides compositions comprising the 
nucleotide sequences SEQ ID NO:l, SEQ ID NO:3, SEQ ID 
NO:22, SEQ ID NO:41, SEQ ID NO:45, SEQ ID NO:47, and 
fragments thereof, which encode for the expression of proteins 
comprising SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:21, SEQ 
ID NO:42, SEQ ID NO:46, SEQ ID NO:48, respectively, and 
25 fragments thereof. While not wanting to be bound by the 

following statement, it is believed that these proteins are 
involved in ROI production. The present invention also 
provides vectors containing these nucleotide sequences, cells 
transfected with these vectors which produce the proteins 
30 comprising SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:21, SEQ 

ID NO:42, SEQ ID NO:46, SEQ ID NO:48, and fragments 
thereof, and antibodies to these proteins and fragments thereof. 
The present invention also provides methods for stimulating 
cellular proliferation by administering vectors encoded for 
35 production of the proteins comprising SEQ ID NO:2, SEQ ID 

NO:4, SEQ ID NO:21, SEQ ID NO:42, SEQ ID NO:46, SEQ 
ID NO:48 and fragments thereof. The present invention also 
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provides methods for stimulating cellular proliferation by 
administering the proteins comprising SEQ ID NO:2, SEQ ID 
NO:4, SEQ ID NO:21, SEQ ID NO:42, SEQ ID NO:46, SEQ 
ID NO:48 and fragments thereof. The nucleotides and 
antibodies of the present invention are useful for the detection, 
localization and measurement of the nucleic acids encoding for 
the production of the proteins of the present invention, and also 
for the detection, localization and measurement of the proteins 
of the present invention. These nucleotides and antibodies can 
be combined with other reagents in kits for the purposes of 
detection, localization and measurement. 

Most particularly, the present invention involves a 
method for regulation of cell division or cell proliferation by 
modifying the activity or expression of the proteins described as 
SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:21, SEQ ID NO:42, 
SEQ ID NO:46, SEQ ID NO:48 or fragments thereof. These 
proteins, in their naturally occurring or expressed forms, are 
expected to be useful in drug development, for example for 
screening of chemical and dmg libraries by observing inhibition 
of the activity of these enzymes. Such chemicals and drugs 
would likely be useful as treatments for cancer, prostatic 
hypertrophy, benign prostatic hypertrophy, hypertension, 
atherosclerosis and many other disorders involving abnormal 
cell growth or proliferation as described below. The entire 
expressed protein may be useful in these assays. Portions of the 
molecule which may be targets for inhibition or modification 
include but are not limited to the binding site for pyridine 
nucleotides (NADPH or NADH), the flavoprotein domain 
(approximately the C-terminal 265 amino acids), and/or the 
binding or catalytic site for flavin adenine dinucleotide (FAD). 

The method of the present invention may be used 
for the development of drugs or other therapies for the 
treatment of conditions associated with abnormal growth 
including, but not limited to the following: cancer, psoriasis, 
prostatic hypertrophy, benign prostatic hypertrophy, 
cardiovascular disease, proliferation of vessels, including but 



wo 00/28031 



5 



10 



15 



20 



25 



30 



PCTAJS99/26592 

10 



not limited to blood vessels and lymphatic vessels, arteriovenous 
malfomiation, vascular problems associated with the eye, 
atherosclerosis, hypertension, and restenosis following 
angioplasty- The enzymes of the present invention are excellent 
targets for the development of drugs and other agents which 
may modulate the activity of these enzymes. It is to be 
understood that modulation of activity may result in enhanced, 
diminished or absence of enzymatic activity. Modulation of the 
activity of these enzymes may be useful in treatment of 
conditions associated with abnormal growth. 

Drags which affect the activity of the enzymes 
represented in SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:21, 
SEQ ID NO:42, SEQ ID NO:46, SEQ ID NO:48, or fragments 
thereof, may also be combined with other therapeutics in the 
treatment of specific conditions. For example, these drags may 
be combined with angiogenesis inhibitors in the treatment of 
cancer, with antihypertensives for the treatment of 
hypertension, and with cholesterol lowering drags for the 
treatment of atherosclerosis. 

Accordingly, an object of the present invention is to 
provide nucleotide sequences, or fragments thereof, encoding 
for the production of proteins, or fragments thereof, that are 
involved in ROI production. 

Another object of the present invention is to 
provide vectors containing these nucleotide sequences, or 
fragments thereof. 

Yet another object of the present invention is to 
provide cells transfected with these vectors. 

Still another object of the present invention is to 
administer cells transfected with these vectors to animals and 
humans. 

Another object of the present invention is to 
provide proteins, or fragments thereof, that are involved in ROI 
production. 

Still another object of the present invention is to 
provide antibodies, including monoclonal and polyclonal 
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antibodies, or fragments thereof, raised against proteins, or 
fragments thereof, that are involved in ROI production. 

Another object of the present invention is to 
administer genes containing nucleotide sequences, or fragments 
5 thereof, encoding for the production of proteins, or fragments 

thereof, that are involved in ROI production, to animals and 
humans and also to cells obtained from animals and humans. 

Another object of the present invention is to 
administer antisense complimentary sequences of genes 

10 containing nucleotide sequences, or fragments thereof, encoding 

for the production of proteins, or fragments thereof, that are 
involved in ROI production, to animals and humans and also to 
cells obtained from animals and humans. 

Yet another object of the present invention is to 

15 provide a method for stimulating or inhibiting cellular 

proliferation by administering vectors containing nucleotide 
sequences, or fragments thereof, encoding for the production of 
proteins, or fragments thereof, that are involved in ROI 
production, to animals and humans. It is also an object of the 

20 present invention to provide a method for stimulating or 

inhibiting cellular proliferation by administering vectors 
containing antisense complimentary sequences of nucleotide 
sequences, or fragments thereof, encoding for the production of 
proteins, or fragments thereof, that are involved in ROI 

25 production, to animals and humans. These methods of 

stimulating cellular proliferation are useful for a variety of 
purposes, including but not Umited to, developing animal 
niodels of tumor formation, stimulating cellular proliferation of 
blood marrow cells following chemotherapy or radiation, or in 

30 cases of anemia. 

Still another object of the present invention is to 
provide antibodies useful in inmiunotherapy against cancers 
expressing the proteins represented in SEQ ID NO:2, SEQ ID 
NO:4, SEQ ID NO:21, SEQ ID NO:42, SEQ ID NO:46, SEQ 

35 ID NO:48 or fragments thereof. 
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Yet another object of the present invention is to 
provide nucleotide probes useful for the detection, localization 
and measurement of nucleotide sequences, or fragments thereof, 
encoding for the production of proteins, or fragments thereof, 
5 that are involved in ROI production. 

Another object of the present invention is to 
provide antibodies useful for the detection, localization and 
measurement of nucleotide sequences, or fragments thereof, 
encoding for the production of proteins, or fragments thereof, 
10 that are involved in ROI production. 

Another object of the present invention is to 
provide kits useful for detection of nucleic acids including the 
nucleic acids represented in SEQ ID NO:l, SEQ ID NO:3, SEQ 
ID NO:22, SEQ ID NO:41, SEQ ID NO:45, SEQ ID NO:47, or 
15 fragments thereof, that encode for proteins, or fragments 

thereof, that are involved in ROI production. 

Yet another object of the present invention is to 
provide kits useful for detection and measurement of nucleic 
acids including the nucleic acids represented in SEQ ID NO:l, 
20 SEQ ID NO:3, SEQ ID NO:22, SEQ ID NO:41, SEQ ID 

NO:45, SEQ ID NO:47, or fragments thereof, that encode for 
proteins, or fragments thereof, that are involved in ROI 
production. 

Still another object of the present invention is to 
25 provide kits useful for the localization of nucleic acids including 

the nucleic acids represented in SEQ ID NO:l, SEQ ID NO:3, 
SEQ ID NO:22, SEQ ID NO:41, SEQ ID NO:45, SEQ ID 
NO:47, or fragments thereof, that encode for proteins, or 
fragments thereof that are involved in ROI production. 
30 Another object of the present invention is to 

provide kits useful for detection of proteins, including the 
proteins represented in SEQ ID NO:2, SEQ ID NO:4, SEQ ID 
NO:21, SEQ ID NO:42, SEQ ID NO:46, SEQ ID NO:48, or 
fragments thereof, that are involved in ROI production. 
35 Yet another object of the present invention is to 

provide kits useftil for detection and measurement of proteins. 



wo 00/28031 



PCTAJS99/26592 



13 



10 



including the proteins represented in SEQ ID NO:2, SEQ ID 
NO:4, SEQ ID NO:21, SEQ ID NO:42, SEQ ID NO:46, SEQ 
ID NO:48, or fragments thereof, that are involved in ROI 
production. 

Still another object of the present invention is to 
provide kits useful for localization of proteins, including the 
proteins represented in SEQ ID NO:2, SEQ ID NO:4, SEQ ID 
NO:21, SEQ ID NO:42, SEQ ID NO:46, SEQ ID NO:48, or 
fragments thereof, that are involved in ROI production. 

Yet another object of the present invention is to 
provides kits useful for the detection, measurement or 
localization of nucleic acids, or fragments thereof, encoding for 
proteins, or fragments thereof, that are involved in ROI 
production, for use in diagnosis and prognosis of abnormal 
15 cellular proliferation related to ROI production. 

Another object of the present invention is to 
provides kits useful for the detection, measurement or 
localization of proteins, or fragments thereof, that are involved 
in ROI production, for use in diagnosis and prognosis of 
20 abnormal cellular proliferation related to ROI production. 

These and other objects, features and advantages of 
the present invention will become apparent after a review of the 
following detailed description of the disclosed embodiments and 
the appended drawings. 

25 

BRIEF DESCRIPTION OF THE HGURES 

Fig. l(a-d). Comparison of amino acid sequences 
of the human moxl protein (labeled moxl.human, SEQ ID 
NO:2), rat moxl protein (labeled moxl.rat, SEQ ID NO:21), 

30 human mox2 protein (labeled mox2.human., SEQ ID NO:4) of 

the present invention to human (gp 91phox/human.pep, SEQ ID 
NO: 12) bovine (gp 91 phox/bo vine. pep, SEQ ID NO:37), and 
murine (gp 91 phox/mouse.pep, SEQ ID NO:38) proteins. Also 
included are related plant enzyme proteins cytb 

35 558.arabidopsis.pep (SEQ ID NO:39) and cytb558.rice.pep. 
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(SEQ ID NO:40). Enclosed in boxes are siniilar amino acid 
residues. 

Fig. 2. Sequence similarities among proteins 
related to gp91phox including human moxl (SEQ ID NO:2), 
5 human mox2 (SEQ ID NO:4), and rat moxl (SEQ ID NO:21). 

The dendrogram indicates the degree of similarity among this 
family of proteins, and also includes the related plant enzymes. 

Fig. 3. Cell free assay for mox-1 activity. 
Superoxide generation was measured using the 
10 chemiluminescent reaction between lucigenin and superoxide in 

cell ly sates from vector control NEF2 and moxl transfected 
NIH3T3 cells. 

Fig. 4. Superoxide generation by human moxl. 
Reduction of NET in moxl transfected and control fibroblasts 
15 was measured in the absence (filled bars) or presence (open 

bars) or superoxide dismutase. 

Fig. 5. Aconitase (filled bars), lactate 
dehydrogenase (narrow hatching) and fumarase (broad 
hatching) were determined in lysates of cells transfected with 
20 vector alone (NEF2) or with moxl (YA26, YA28 and YA212). 

DETAILED DESCRIPTION OF THE INVENTION 

The present invention solves the problems 
described above by providing a novel family of nucleotide 

25 sequences and proteins, encoded by these nucleotide sequences, 

termed mox proteins and duox proteins. The term "mox" 
refers to "mitogenic oxidase" while the term "duox" refers to 
"dual oxidase". In particular, the present invention provides 
novel compositions comprising the nucleotide sequences SEQ ID 

30 NO:l, SEQ ID NO:3, SEQ ID NO:22, SEQ ID NO:41, SEQ ID 

NO:45, SEQ ID NO:47, and fragments thereof, which encode, 
respectively, for the expression of proteins comprising SEQ ID 
NO:2, SEQ ID NO:4, SEQ ID NO:21, SEQ ID NO:42, SEQ ID 
NO:46, SEQ ID NO:48 and fragments thereof. 

35 Both the mox and duox proteins described herein 

have homology to the gp91phox protein involved in ROI 



wo 00/28031 



PCTAJS99/26592 



15 



generation, however, the mox and duox proteins comprise a 
novel and distinct family of proteins. The mox proteins 
included in the present invention have a molecular weight of 
approximately 65 kDa as determined by reducing gel 
5 electrophoresis and are capable of inducing ROI generation in 

cells. As described in more detail below, the mox proteins of 
the present invention also function in the regulation of cell 
growth, and are therefore implicated in diseases involving 
abnormal cell growth such as cancer. The present invention 

10 describes mox proteins found in human and rat, however, it is 

likely that the mox family of genes/proteins is widely 
distributed among multicellular organisms. 

The duox proteins described herein are larger than 
the mox proteins and have three distinct regions: the amino 

15 terminal region having homology to peroxidase proteins, the 

internal region having homology to calmodulin (CAM) proteins 
and the carboxy-terminal region having homology to mox 
proteins. Human duoxl is shown in SEQ ID NO:46 and a 
portion of human duox2 is shown in SEQ ID NO:48. 

20 Nucleotides encoding duoxl and duox2 proteins are also shown 

in SEQ ID NO: 45 and SEQ ID NO:47, respectively. In' 
addition to the human duox proteins, comparison of the 
sequence of human duoxl and human duox2 with genomic 
databases using BLAST searching resulted in the identification 

25 of two homologs of duox in C elegans (Ce-duoxl and Ce- 

duox2). Drosophila also appears to have at least one duox 
homolog. Thus, the duox family of genes/proteins is widely 
distributed among multicellular organisms. 

Although not wanting to be bound by the following 

30 statement, it is believed that duoxl and duox2 have dual 

enzymatic functions, catalyzing both the generation of 
superoxide and peroxidative type reactions. The latter class of 
reactions utilize hydrogen peroxide as a substrate (and in some 
cases have been proposed to utilize superoxide as a substrate). 

35 Since hydrogen peroxide is generated spontaneously from the 

dismutation of superoxide, it is believed that the NAD(P)H 
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oxidase domain generates the superoxide and/or hydrogen 
peroxide which can then be used as a substrate for the 
peroxidase domain. In support of this hypothesis, a model for 
the duoxl protein in C elegans has been developed that has an 
5 extracellular N-terminal peroxidase domain, a transmembrane 

region and a NADPH binding site located on the cytosolic face 
of the plasma membrane. By analogy with the neutrophil 
NADPH-oxidase which generates extracellular superoxide, 
human duoxl is predicted to generate superoxide and its 

10 byproduct hydrogen peroxide extracellularly where it can be 

utilized by the peroxidase domain. 

While the ROI generated by duoxl and duox2 may 
function as does moxl in regulation of cell growth, the presence 
of the peroxidase domain is likely to confer additional 

15 biological functions. Depending upon the co-substrate, 

peroxidases can participate in a variety of reactions including 
halogenation such as the generation of hypochlorous acid 
(HOC!) by myeloperoxidase and the iodination of tyrosine to 
form thyroxin by thyroid peroxidase. Peroxidases have also 

20 been documented to participate in the metabolism of 

polyunsaturated fatty acids, and in the chemical modification of 
tyrosine in collagen (by sea urchin ovoperoxidase). Although 
not wanting to be bound by this statement, it is believed that the 
predicted transmembrane nature of duoxl facilitates its function 

25 in the formation or modification of extracellular matrix or 

basement membrane. Since the extracellular matrix plays an 
important role in tumor cell growth, invasion and metastasis, it 
is believed that the duox type enzymes play a pathogenic role in 
such conditions. 

30 In addition to the nucleotide sequences described 

above, the present invention also provides vectors containing 
these nucleotide sequences and fragments thereof, cells 
transfected with these vectors which produce the proteins 
comprising SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:21, SEQ 

35 ID NO:42, SEQ ID NO:46, SEQ ID NO:48 and fragments 

thereof, and antibodies to these proteins and fragments thereof. 
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The present invention also provides methods for stimulating 
cellular proliferation by administering vectors, or cells 
containing vectors, encoded for production of the proteins 
comprising SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:21, SEQ 
5 ID NO:42, SEQ ID NO:46, SEQ ID NO:48 and fragments 

thereof. The nucleotides and antibodies of the present invention 
are useful for the detection, localization and measurement of the 
nucleic acids encoding for the production of the proteins of the 
present invention, and also for the detection, localization and 

10 measurement of the proteins of the present invention. These 

nucleotides and antibodies can be combined with other reagents 
in kits for the purposes of detection, localization and 
measurement. These kits are useful for diagnosis and prognosis 
of conditions involving cellular proliferation associated with 

15 production of reactive oxygen intermediates. 

The present invention solves the problems 
described above by providing a composition comprising the 
nucleotide sequence SEQ ID NO:l and fragments thereof. The 
present invention also provides a composition comprising the 

20 nucleotide sequence SEQ ID NO:3 and fragments thereof. The 

present invention also provides a composition comprising the 
nucleotide sequence SEQ ID NO:22 and fragments diereof. The 
present invention also provides a composition comprising the 
nucleotide sequence SEQ ID NO:41 and fragments thereof. The 

25 present invention also provides a composition comprising the 

nucleotide sequence SEQ ID NO:45 and fragments thereof. The 
present invention also provides a composition comprising the 
nucleotide sequence SEQ ID NO:47 and fragments thereof. 

The present invention provides a composition 

30 comprising the protein SEQ ID NO:2 encoded by the nucleotide 

sequence SEQ ID NO:l. The present invention provides a 
composition comprising the protein SEQ ID NO:4 encoded by 
the nucleotide sequence SEQ ID NO:3. The present invention 
provides a composition comprising the protein SEQ ID NO:21 

J5 encoded by the nucleotide sequence SEQ ID NO:22. The 

present invention provides a composition comprising the protein 
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SEQ ID NO:42 encoded by the nucleotide sequence SEQ ID 
NO:4L The present invention provides a composition 
comprising the protein SEQ ID NO:46 encoded by the 
nucleotide sequence SEQ ID NO:45, The present invention 
5 provides a composition comprising the protein SEQ ID NO:48 

encoded by the nucleotide sequence SEQ ID NO:47, 

The present invention provides a composition 
comprising the protein SEQ ID NO:2 or fragments thereof, 
encoded by the nucleotide sequence SEQ ID NO:l or fragments 

10 thereof- The present invention also provides a composition 

comprising the protein SEQ ID NO:4 or fragments thereof, 
encoded by the nucleotide sequence SEQ ID NO:3 or fragments 
thereof. The present invention also provides a composition 
comprising the protein SEQ ID NO:21 or fragments thereof, 

15 encoded by the nucleotide sequence SEQ ID NO:22 or 

fragments thereof. The present invention also provides a 
composition comprising the protein SEQ ID NO:42 or 
fragments thereof, encoded by the nucleotide sequence SEQ ID 
NO:41 or fragments thereof. The present invention also 

20 provides a composition comprising the protein SEQ ID NO:46 

or fragments thereof, encoded by the nucleotide sequence SEQ 
ID NO:45 or fragments thereof. The present invention also 
provides a composition comprising the protein SEQ ID NO:48 
or fragments thereof, encoded by the nucleotide sequence SEQ 

25 ID Nb:47 or fragments thereof. 

The present invention also provides vectors 
containing the nucleotide sequences SEQ ID NO:l, SEQ ID 
NO:3, SEQ ID NO:22, SEQ ID NO:41, SEQ ID NO:45, SEQ 
ID NO:47 or fragments thereof. The present invention also 

30 provides cells transfected with these vectors. In addition, the 

present invention provides cells stably transfected with the 
nucleotide sequence SEQ ID NO:l or fragments thereof. The 
present invention also provides cells stably transfected with the 
nucleotide sequence SEQ ID NO:3 or fragments thereof. The 

35 present invention also provides cells stably transfected with the 

nucleotide sequence SEQ ID NO:22 or fragments thereof. The 
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present invention also provides cells stably transfected with the 
nucleotide sequence SEQ ID NO:41 or fragments thereof. The 
present invention also provides cells stably transfected with the 
nucleotide sequence SEQ ID NO:45 or fragments thereof. The 
5 present invention also provides cells stably transfected with the 

nucleotide sequence SEQ ID NO:47 or fragments thereof. 

The present invention provides cells stably 
transfected with the nucleotide sequence SEQ ID NO:l or 
fragments thereof, which produce the protein SEQ ID NO:2 or 

10 fragments thereof. In addition, the present invention provides 

cells stably transfected with the nucleotide sequence SEQ ID 
NO: 3 or fragments thereof which produce the protein SEQ ID 
NO:4 or fragments thereof. In addition, the present invention 
provides cells stably transfected with the nucleotide sequence 

15 SEQ ID NO:22 or fragments thereof which produce the protein 

SEQ ID NO:21 or fragments thereof. The present invention 
also provides cells stably transfected with the nucleotide 
sequence SEQ ID NO:41 or fragments thereof which produce 
the protein SEQ ID NO:42 or fragments thereof. The present 

20 invention also provides cells stably transfected with the 

nucleotide sequence SEQ ID NO:45 or fragments thereof which 
produce the protein SEQ ID NO:46 or fragments thereof. The 
present invention also provides cells stably transfected with the 
nucleotide sequence SEQ ID NO:47 or fragments thereof which 

25 produce the protein SEQ ID NO:48 or fragments thereof. 

The present invention provides a method for 
stimulating growth by administering cells stably transfected 
with the nucleotide sequence SEQ ID NO: 1 which produce the 
protein SEQ ID NO:2 or fragments thereof. The present 

30 invention also provides a method for stimulating growth by 

administering cells stably transfected with the nucleotide 
sequence SEQ ID NO:3 or fragments thereof, which produce 
the protein SEQ ID NO:4 or fragments thereof. The present 
invention also provides a method for stimulating growth by 

35 administering cells stably transfected with the nucleotide 

sequence SEQ ID NO:22 or fragments thereof, which produce 
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the protein SEQ ID NO:21 or fragments thereof. The present 
invention also provides a method for stimulating growth by 
administering cells stably transfected with the nucleotide 
sequence SEQ ID NO:41 or fragments thereof, which produce 
the protein SEQ ID NO:42 or fragments thereof. The present 
invention also provides a method for stimulating growth by 
administering cells stably transfected with the nucleotide 
sequence SEQ ID NO:45 or fragments thereof, which produce 
the protein SEQ ID NO:46 or fragments thereof. The present 
invention also provides a method for stimulating growth by 
administering cells stably transfected with the nucleotide 
sequence SEQ ID NO:47 or fragments thereof, which produce 
the protein SEQ ID NO:48 or fragments thereof. 

Specifically, the present invention provides a 
method for stimulating tumor formation by administering cells 
stably transfected with the nucleotide sequence SEQ ID NO:l or 
fragments thereof, which produce the protein SEQ ID NO:2 or 
fragments thereof. The present invention also provides a 
method for stimulating tumor formation by administering cells 
stably transfected with the nucleotide sequence SEQ ID NO: 3 or 
fragments thereof, which produce the protein SEQ ID NO:4 pr 
fragments thereof. The present invention also provides a 
method for stimulating tumor formation by administering cells 
stably transfected with the nucleotide sequence SEQ ID NO:22 
or fragments thereof, which produce the protein SEQ ID NO:21 
or fragments thereof. The present invention also provides a 
method for stimulating tumor formation by administering cells 
stably transfected with the nucleotide sequence SEQ ID NO:41 
or fragments thereof, which produce the protein SEQ ID NO:42 
or fragments thereof. The present invention also provides a 
method for stimulating tumor formation by administering cells 
stably transfected with the nucleotide sequence SEQ ID NO:45 
or fragments thereof, which produce the protein SEQ ID NO:46 
or fragments thereof. The present invention also provides a 
method for stimulating tumor formation by administering cells 
stably transfected with the nucleotide sequence SEQ ID NO:47 
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or fragments thereof, which produce the protein SEQ ID NO:48 
or fragments thereof. 

The present invention may also be used to develop 
anti-sense nucleotide sequences to SEQ ID NO:l, SEQ ID NO:3, 
SEQ ID NO:22, SEQ ID NO:41, SEQ ID NO:45, SEQ ID 
NO:47 or fragments thereof. These anti-sense molecules may 
be used to interfere with translation of nucleotide sequences, 
such as SEQ ID NO:l, SEQ ID NO:3, SEQ ID NO:22, SEQ ID 
NO:41, SEQ ID NO:45, SEQ ID NO:47, or fragments thereof, 
that encode for proteins such as SEQ ID NO:2, SEQ ID NO:4, 
SEQ ID NO:21, SEQ ID NO:42, SEQ ID NO:46, SEQ ID 
NO:48 or fragments thereof. Administration of these anti-sense 
molecules, or vectors encoding for anti sense molecules, to 
humans and animals, would interfere with production of 
15 proteins such as SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:21, 

SEQ ID NO:42, SEQ ID NO:46, SEQ ID NO:48, or fragments 
thereof, thereby decreasing production of ROIs and inhibiting 
cellular proliferation. These methods are useful in producing 
animal models for use in study of tumor development and 
20 vascular growth, and for study of the efficacy of treatments for 

affecting tumor and vascular growth in vivo. 

The present invention also provides a method for 
high throughput screening of drags and chemicals which 
modulate the proliferative activity of the enzymes of the present 
25 invention, thereby affecting cell division. Combinatorial 

chemical libraries may be screened for chemicals which 
modulate the proliferative activity of these enzymes. Drugs and 
chemicals may be evaluated based on their ability to modulate 
the enzymatic activity of the expressed or endogenous proteins, 
30 including those represented by SEQ ID NO:2, SEQ ID NO:4, 

SEQ ID NO:21, SEQ ID NO:42, SEQ ID NO:46, SEQ ID 
NO:48 or fragments thereof. Endogenous proteins may be 
obtained from many different tissues or cells, such as colon 
cells. Drags may also be evaluated based on their ability to bind 
35 to the expressed or endogenous proteins represented by SEQ ID 

NO:2, SEQ ID NO:4, SEQ ID NO:21, SEQ ID NO:42, SEQ ID 
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NO:46, SEQ ID NO:48 or fragments thereof. Enzymatic 
activity may be NADPH- or NADH-dependent superoxide 
generation catalyzed by the holoprotein. Enzymatic activity 
may also be NADPH- or NADH-dependent diaphorase activity 
5 catalyzed by either the holoprotein or the flavoprotein domain. 

By flavoprotein domain, is meant approximately 
the C-terminal half of the enzymes shown in SEQ ID NO:2, 
SEQ ID NO:4, SEQ ID NO:21, SEQ ID NO:42, or fragments 
thereof, and the C-terminal end of the enzymes shown in SEQ 

10 ID NO:46 and SEQ ID NO:48 (approximately the C-terminal 

265 amino acids). This fragment of gp91phox has NADPH- 
dependent reductase activity towards cytochrome c, 
nitrobluetetrazolium and other dyes. Expressed proteins or 
fragments thereof can be used for robotic screens of existing 

15 combinatorial chemical libraries. While not wanting to be 

bound by the following statement, it is believed that the NADPH 
or NADH binding site and the FAD binding site are useful for 
evaluating the ability of drugs and other compositions to bind to 
the mox and duox enzymes or to modulate their enzymatic 

20 activity. The use of the holoprotein or the C-terminal half or 

end regions are preferred for developing a high throughput 
drug screen. Additionally, the N-terminal one-third of the duox 
domain (the peroxidase domain) may also be used to evaluate 
the ability of drugs and other compositions to inhibit the 

25 peroxidase activity, and for further development of a high 

throughput dmg screen. 

The present invention also provides antibodies 
directed to the proteins SEQ ID NO:2, SEQ ID NO:4, SEQ ID 
NO:21, SEQ ID NO:42, SEQ ID NO:46, SEQ ID NO:48 and 

30 fragments thereof. The antibodies of the present invention are 

useful for a variety of purposes including localization, detection 
and measurement of the proteins SEQ ID NO:2, SEQ ID NO:4, 
SEQ ID NO:21, SEQ ID NO:42, SEQ ID NO:46, SEQ ID 
NO:48 and fragments thereof. The antibodies may be employed 

J5 in kits to accomplish these purposes. These antibodies may also 

be linked to cytotoxic agents for selected killing of cells. The 
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term antibody is meant to include any class of antibody such as 
IgG, IgM and other classes. The term antibody also includes a 
completely intact antibody and also fragments thereof, including 
but not limited to Fab fragments and Fab + Fc fragments. 
5 The present invention also provides the nucleotide 

sequences SEQ ID NO:l, SEQ ID NO:3, SEQ ID NO:22, SEQ 
ID NO:41, SEQ ID NO:45, SEQ ID NO:47 and fragments 
thereof. These nucleotides are useful for a variety of purposes 
including localization, detection, and measurement of messenger 

10 RNA involved in synthesis of the proteins represented as SEQ 

ID NO:2, SEQ ID NO:4, SEQ ID NO:21, SEQ ID NO:42, SEQ 
ID NO:46, SEQ ID NO:48 and fragments thereof. These 
nucleotides may also be used in the construction of labeled 
probes for the localization, detection, and measurement of 

15 nucleic acids such as messenger RNA or alternatively for the 

isolation of larger nucleotide sequences containing the 
nucleotide sequences shown in SEQ ID NO:l, SEQ ID NO:3, 
SEQ ID NO:22, SEQ ID NO:41, SEQ ID NO:45, SEQ ID 
NO:47 or fragments thereof. These nucleotide sequences may 

20 be used to isolate homologous strands from other species using 

techniques known to one of ordinary skill in the art. These 
nucleotide sequences may also be used to make probes and 
complementary strands. In particular, the nucleotide sequence 
shown in SEQ ID NO:47 may be used to isolate the complete 

25 coding sequence for duox2. The nucleotides may be employed 

in kits to accomplish these purposes. 

Most particularly, the present invention involves a 
method for modulation of growth by modifying the proteins 
represented as SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:21, 

30 SEQ ID NO:42, SEQ ID NO:46, SEQ ID NO:48 or fragments 

thereof. 

The term "mitogenic regulators" is used herein to 
mean any molecule that acts to affect cell division. 

The term "animal" is used herein to mean humans 
35 and non-human animals of both sexes. 
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The tenns "a", "an" and "the" as used herein are 
defined to mean one or nniore and include the plural unless the 
context is inappropriate. 

"Proteins", ^'peptides," "polypeptides" and 
5 "oligopeptides" are chains of amino adds (typically L-amino 

acids) whose alpha carbons are linked through peptide bonds 
formed by a condensation reaction between the carboxyl group 
of the alpha carbon of one amino acid and the amino group of 
the alpha carbon of another amino acid. The terminal amino 

10 acid at one end of the chain (i.e., the amino terminal) has a free 

amino group, while the terminal amino acid at the other end of 
the chain (i.e., the carboxy terminal) has a free carboxyl group. 
As such, the term "amino terminus" (abbreviated N-terminus) 
refers to the free alpha-amino group on the amino acid at the 

15 amino terminal of the protein, or to the alpha-amino group 

(imino group when participating in a peptide bond) of an amino 
acid at any other location within the protein. Similarly, the 
term "carboxy terminus" (abbreviated C-terminus) refers to the 
free carboxyl group on the amino acid at the carboxy terminus 

20 of a protein, or to the carboxyl group of an amino acid at any 

other location within the protein. 

Typically, the amino acids making up a protein are 
numbered in order, starting at the amino terminal and 
increasing in the direction toward the carboxy terminal of the 

25 protein. Thus, when one amino acid is said to "follow" another, 

that amino acid is positioned closer to the carboxy terminal of 
the protein than the preceding amino acid. 

The term "residue" is used herein to refer to an 
amino acid (D or L) or an amino acid mimetic that is 

30 incorporated into a protein by an amide bond. As such, the 

amino acid may be a naturally occurring amino acid or, unless 
otherwise limited, may encompass known analogs of natural 
anoino acids that function in a manner similar to the naturally 
occurring amino acids (/.e., amino acid mimetics). Moreover, 

35 an amide bond mimetic includes peptide backbone modifications 

well known to those skilled in the art. 
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Furthermore, one of skill will recognize that, as 
mentioned above, individual substitutions, deletions or additions 
which alter, add or delete a single amino acid or a small 
percentage of amino acids (typically less than 5%, more 
5 typically less than 1%) in an encoded sequence are 

conservatively modified variations where the alterations result 
in the substitution of an amino acid with a chemically similar 
amino acid. Conservative substitution tables providing 
functionally similar anwno acids are well known in the art. The 
10 following six groups each contain amino acids that are 

conservative substitutions for one another: 

1) Alanine (A), Serine (S), Threonine (T); 

2) Aspartic acid (D), Glutamic acid (E); 

3) Asparagine (N), Glutamine (Q); 
15 4) Arginine (R), Lysine (K); 

5) Isoleucine (I), Leucine (L), Methionine (M), Valine 
(V); and 

6) Phenylalanine (F), Tyrosine (Y), Tryptophan (W). 

20 When the peptides are relatively short in length 

(i.e., less than about 50 amino acids), they are often synthesized 
using standard chemical peptide synthesis techniques. Solid 
phase synthesis in which the C-terminal amino acid of the 
sequence is attached to an insoluble support followed by 

25 sequential addition of the remaining amino acids in the sequence 

is a preferred method for the chemical synthesis of the antigenic 
epitopes described herein. Techniques for solid phase synthesis 
are known to those skilled in the art. 

Alternatively, the antigenic epitopes described 

30 herein are synthesized using recombinant nucleic acid 

methodology. Generally, this involves creating a nucleic acid 
sequence that encodes the peptide or protein, placing the nucleic 
acid in an expression cassette under the control of a particular 
promoter, expressing the peptide or protein in a host, isolating 

35 the expressed peptide or protein and, if required* renaturing the 
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peptide or protein. Techniques sufficient to guide one of skill 
through such procedures are found in the literature. 

When several desired protein fragments or peptides 
are encoded in the nucleotide sequence incorporated into a 
vector, one of skill in the art will appreciate that the protein 
fragments or peptides may be separated by a spacer molecule 
such as, for example, a peptide, consisting of one or more 
amino acids. Generally, the spacer will have no specific 
biological activity other than to join the desired protein 
fragments or peptides together, or to preserve some minimum 
distance or other spatial relationship between them. However, 
the constituent amino acids of the spacer may be selected to 
influence some property of the molecule such as the folding, net 
charge, or hydrophobicity. Nucleotide sequences encoding for 
the production of residues which may be useful in purification 
of the expressed recombinant protein may be built into the 
vector. Such sequences are known in the art. For example, a 
nucleotide sequence encoding for a poly histidine sequence may 
be added to a vector to facilitate purification of the expressed 
recombinant protein on a nickel column. 

Once expressed, recombinant peptides, polypeptides 
and proteins can be purified according to standard procedures 
known to one of ordinary skill in the art, including ammonium 
sulfate precipitation, affinity colunms, column chromatography, 
gel electrophoresis and the like. Substantially pure 
compositions of about 50 to 99% homogeneity are preferred, 
and 80 to 95% or greater homogeneity are most preferred for 
use as therapeutic agents. 

One of skill in the art will recognize that after 
chemical synthesis, biological expression or purification, the 
desired proteins, fragments thereof and peptides may possess a 
conformation substantially different than the native 
conformations of the proteins, fragments thereof and peptides. 
In this case, it is often necessary to denature and reduce protein 
and then to cause the protein to re-fold into the preferred 
confomniation. Methods of reducing and denaturing proteins 



wo 00/28031 



5 



10 



15 



20 



25 



30 



PCTAJS99/26592 

27 



and inducing re-folding are well known to those of skill in the 
art. 

The genetic constructs of the present invention 
include coding sequences for different proteins, fragments 
thereof, and peptides. The genetic constructs alsd include 
epitopes or domains chosen to permit purification or detection 
of the expressed protein. Such epitopes or domains include 
DNA sequences encoding the glutathione bmding domain from 
glutathione S-transferase, hexa-histidine, thioredoxin, 
hemagglutinin antigen, maltose binding protein, and others 
commonly known to one of skill in the art. The preferred 
genetic construct includes the nucleotide sequences of SEQ ID 
NO:l, SEQ ID NO:3. SEQ ID NO:22, SEQ ID NO:41, SEQ ID 
NO:45, SEQ ID NO:47 or fragments thereof. It is to be 
understood that additional or alternative nucleotide sequences 
may be included in the genetic constructs in order to encode for 
the foUowing: a) multiple copies of the desired proteins, 
fragments thereof, or peptides; b) various combinations of the 
desired proteins, fragments thereof, or peptides; and c) 
conservative modifications of the desired proteins, fragments 
thereof, or peptides, and combinations thereof. Preferred 
proteins include the human moxl protein and human mox2 
protein shown as SEQ ID NO:2 and SEQ ID NO:4, respectively, 
and fragments thereof. Some preferred fragments of the human 
moxl protein (SEQ ID NO:2) include but are not limited to the 
proteins shown as SEQ ED NO:23, SEQ ID NO:24, and SEQ ID 
NO:25. The protein moxl is also called p65mox in this 
application. Another preferred protein of the present invention 
is rat moxl protein shown as SEQ ID NO:21 and fragments 
thereof. Another preferred protein of the present invention is 
rat moxlB protein shown as SEQ ID NO:42 and fragments 
thereof. Yet another preferred protein of the present invention 
is duoxl protein shown as SEQ ID NO:46 and fragments 
thereof. Still another preferred protein of the present invention 
is duox2 protein. A partial amino acid sequence of the duox2 
protein is shovm as SEQ ID NO:48. 
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The nucleotide sequences of the present invention 
may also be employed to hybridize to nucleic acids such as DNA 
or RNA nucleotide sequences under high stringency conditions 
which permit detection, for example, of alternately spliced 
messages. 

The genetic constract is expressed in an expression 
system such as in NIH 3T3 cells using recombinant sequences in 
a pcDNA-3 vector (Invitrogen, Carlsbad, CA) to produce a 
recombinant protein. Preferred expression systems include but 
are not limited to Cos-7 cells, insect cells using recombinant 
baculovirus, and yeast. It is to be understood that other 
expression systems known to one of skill in the art may be used 
for expression of the genetic constructs of the present invention. 
The preferred proteins of the present invention are the proteins 
15 referred to herein as human moxl and human mox2 or 

fragments thereof which have the amino acid sequences set forth 
in SEQ ID NO:3 and SEQ ID NO:4, respectively, or an amino 
acid sequence having amino acid substitutions as defined in the 
definitions that do not significantiy alter the function of the 
20 recombinant protein in an adverse manner. Another preferred 

protein of the present invention is referred to herein as rat 
moxl and has the amino acid sequence set forth in SEQ ID 
NO:21. Yet another preferred protein of the present invention 
is referred to herein as rat moxlB and has the amino acid 
25 sequence set forth in SEQ ID NO:42. Two other preferred 

proteins of the present invention are referred to herein as 
human duoxl and human duox2, or fragments thereof, which 
have the amino acid sequences set forth in SEQ ID NO:46 and 
SEQ ID NO:48, respectively, or an amino acid sequence having 
30 amino acid substitutions as defined in the definitions that do not 

significantly alter the function of the recombinant protein in an 
adverse manner. 

Terminology 

35 It should be understood that some of the 

terminology used to describe the novel mox and duox proteins 
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contained herein is different from the terminology in U,S. 
Provisional Application Serial No. 60/107,911 and U.S. 
Provisional Application Serial No. 60/149,332 upon which this 
application claims priority in part. As described herein, the 
term "human moxl" refers to a protein comprising an amino 
acid sequence as set forth in SEQ ID NO:2, or a fragment 
thereof, and encoded by the nucleotide sequence as set forth in 
SEQ ID NO:l, or a fragment thereof. As described herein, the 
term "human mox2" refers to a protein comprising an amino 
add sequence as set forth in SEQ ID NO:4, or a fragment 
thereof, and encoded by the nucleotide sequence as set forth in 
SEQ ID NO:3, or a fragment thereof. As described herein, the 
term "human duoxl" refers to a protein comprising an amino 
acid sequence as set forth in SEQ ID NO:46, or a fragment 
thereof, and encoded by the nucleotide sequence as set forth in 
SEQ ID NO:45, or a fragment thereof. As described herein, 
the term "human duox2" refers to a protein comprising an 
amino acid sequence as set forth in SEQ ID NO:48, or a 
fragment thereof, and encoded by the nucleotide sequence as set 
forth in SEQ ID NO:47, or a fragment thereof. 

Construction of the Recombinant Gene 

The desired gene is ligated into a transfer vector, 
such as pcDNA3, and the recombinants are used to transform 
host cells such as Cos-7 cells. It is to be understood that 
different transfer vectors, host cells, and transfection methods 
may be employed as conmionly known to one of ordinary skill 
in the art. Six desired genes for use in transfection are shown 
in SEQ ID NO:l, SEQ ID NO:3, SEQ ID NO:22, SEQ ID 
NO:41 SEQ ID NO:45 and SEQ ID NO:47. For example, 
lipofectanMne-mediated transfection and in vivo homologous 
recombination was used to introduce the moxl gene into NIH 
3T3 cells. 

The synthetic gene is cloned and the recombinant 
construct containing mox or duox gene is produced and grown 
in confluent monolayer cultures of a Cos-7 cell line. The 
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expressed recombinant protein is then purified, preferably using 
affinity chromatography techniques, and its purity and 
specificity determined by known methods, 

A variety of expression systems may be employed 
5 for expression of the recombinant protein. Such expression 

methods include, but are not limited to the following: bacterial 
expression systems, including those utilizing E. coli and Bacillus 
subtilis; virus systems; yeast expression systems; cultured insect 
and mammalian cells; and other expression systems known to 
10 one of ordinary skill in the art. 

Transfection of Cells 

It is to be understood that the vectors of the present 
invention may be transfected into any desired cell or cell line. 

15 Both in vivo and in vitro transfection of cells are contemplated 

as part of the present invention. Preferred cells for transfection 
include but are not Ihnited to the following: fibroblasts 
(possibly to enhance wound healing and skin formation), 
granulocytes (possible benefit to increase function in a 

20 compromised inmiune system as seen in AIDS, and aplastic 

anemia), muscle cells, neuroblasts, stem cells, bone marrow 
cells, osteoblasts, B lymphocytes, and T lymphocytes. 

Cells niay be transfected with a variety of methods 
known to one of ordinary skill in the art and include but are not 

25 limited to the following: electroporation, gene gun, calcium 

phosphate, lipofectamine, and fugene, as well as adenoviral 
transfection systems. 

Host cells transfected with the nucleic acids 
represented in SEQ ID NO:l, SEQ ID NO:3, SEQ ID NO:22, 

30 SEQ ID NO:41 SEQ ID NO:45 and SEQ ID NO:47, or 

fragments thereof, are used to express the proteins SEQ ID 
NO:2, SEQ ID NO:4, SEQ ID NO:21, SEQ ID NO:42, SEQ ID 
, NO:46 and SEQ ID NO:48, respectively, or fragments thereof. 

These expressed proteins are used to raise 

35 antibodies. These antibodies may be used for a variety of 

applications including but not limited to immunotherapy against 
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cancers expressing one of the mox or duox proteins, and for 
detection, localization and measurement of the proteins shown 
in SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:21, SEQ ID 
NO:42, SEQ ID NO:46 or SEQ ID NO:48 or fragments thereof. 

Purification and Characterization of the Expressed Protein 

The proteins of the present invention can be 
expressed as a fusion protein with a poly histidine component, 
such as a hexa histidine, and purified by binding to a metal 
affinity colunm using nickel or cobalt affinity matrices. The 
protein can also be expressed as a fusion protein with 
glutathione S-transferase and purified by affinity 
chromatography using a glutathione agarose matrix. The 
protein can also be purified by immunoaffinity chromatography 
by expressing it as a fusion protein, for example with 
hemagglutinin antigen. The expressed or naturally occurring 
protein can also be purified by conventional chromatographic 
and purification methods which include anion and cation 
exchange chromatography, gel exclusion chromatography, 
hydroxylapatite chromatography, dye binding chromatography, 
ammonium sulfate precipitation, precipitation in organic 
solvents or other techniques commonly known to one of skill in 
the art. 

Methods of Assessing Activity of Expressed Proteins 

Different methods are available for assessing the 
activity of the expressed proteins of the present invention, 
including but not limited to the proteins represented as SEQ ID 
NO:2, SEQ ID NO:4, SEQ ID NO:21, SEQ ID NO:42, SEQ ID 
NO:46 or SEQ ID NO:48 substituted analogs thereof, and 
fragments thereof. 

1- Assays of the holoprotein and fragments 
thereof for superoxide genera tinti - 

A. General considerations. 
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These assays are useful in assessing efficacy of 
drugs designed to modulate the activity of the enzymes of the 
present invention. The holoprotein may be expressed in COS-7 
cells, NIH 3T3 cells, insect cells (using baculoviral technology) 
5 or other cells using methods known to one of skill in the art. 

Membrane fractions or purified protein are used for the assay. 
The assay may require or be augmented by other cellular 
proteins such as p47phox, p67phox, and Racl, as well as 
potentially other unidentified factors (e.g., kinases or other 
10 regulatory proteins). 

B. Cytochrome c reduction. 

NADPH or NADH is used as the reducing substrate, 
in a concentration of about 100 |uiM. Reduction of cytochrome c 

15 is monitored spectrophotometrically by the increase in 

absorbance at 550 mn, assuming an extinction coefficient of 21 
mM-lcm-1. The assay is perforaied in the absence and 
presence of about 10 jig superoxide dismutase. The superoxide- 
dependent reduction is defined as cytochrome c reduction in the 

20 absence of superoxide dismutase minus that in the presence of 

superoxide dismutase (Uhlinger et al. (1991) 7. Biol. Chem, 
266, 20990-20997). Acetylated cytochrome c may also be used, 
since the reduction of acetylated cytochrome c is thought to be 
exclusively via superoxide. 

25 

C. Nitroblue tetrazolium reduction. 

For nitroblue tetrazolium (NBT) reduction, the 
same general protocol is used, except that NBT is used in place 
of cytochrome c. In general, about 1 mL of filtered 0.25 % 

30 nitrotetrazolium blue (Sigma, St. Louis, MO) is added in Hanks 

buffer without or with about 600 Units of superoxide dismutase 
(Sigma) and samples are incubated at approximately 37''C. The 
oxidized NBT is clear, while the reduced NBT is blue and 
insoluble. The insoluble product is collected by centrifugation, 

35 and the pellet is re-suspended in about 1 mL of pyridine 

(Sigma) and heated for about 10 minutes at 100°C to solubilize 
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the reduced NBT, The concentration of reduced NBT is 
determined by measuring the absorbance at 510 nm, using an 
extinction coefficient of 11,000 M-lcm^l. Untreated wells are 
used to determine cell number. 

D. Luminescence. 

Superoxide generation may also be monitored with 
a chemiluminescence detection system utilizing lucigenin (bis-N- 
methylacridinium nitrate, Sigma, St. Louis, MO). The sample 
is mixed with about 100 ^iM NADPH (Sigma, St. Louis, MO) 
and 10 |LiM lucigenin (Sigma, St. Louis, MO) in a volume of 
about 150 ^L Hanks solution. Luminescence is monitored in a 
96-well plate using a LumiCounter (Packard, Downers Grove, 
XL) for 0.5 second per reading at approximately 1 minute 
intervals for a total of about 5 minutes; the highest stable value 
in each data set is used for comparisons. As above, superoxide 
dismutase is added to some samples to prove that the 
luminescence arises from superoxide. A buffer blank is 
subtracted from each reading (Ushio-Fukai et al. (1996) 7. BioL 
Chem. 271, 23317-23321). 

E. Assays in intact celis. 

Assays for superoxide generation may be 
performed using intact cells, for example, the mox-transfected 
NIH 3T3 cells. In principle, any of the above assays can be used 
to evaluate superoxide generation using intact cells, for 
example, the mox-transfected NIH 3T3 cells. NBT reduction is 
a preferred assay method. 

2. Assays of truncated proteins comprised of 
approximately the C -terminal 265 amino acid residues 

While not wanting to be bound by the following 
statement, the truncated protein comprised of approximately the 
C-terminal 265 amino acid residues is not expected to generate 
superoxide, and therefore, superoxide dismutase is not added in . 
assays of the truncated protein. Basically, a similar assay is 
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established and the superoxide-independent reduction of NBT, 
cytochrome c, dichlorophenolindophenol, ferricyanide, or 
another redox-active dye is examined. 

5 Nucleotides and Nucleic Acid Probes 

The nucleotide sequences SEQ ID NO:l, SEQ ID 
NO:3, SEQ ID NO:22, SEQ ID NO:41 SEQ ID NO:45 and SEQ 
ID NO:47, as well as fragments thereof and PGR primers 
therefor, may be used, respectively, for localization, detection 
10 and measurement of nucleic acids related to SEQ ID NO:l, SEQ 

ID NO:3, SEQ ID NO:22, SEQ ID NO:41 SEQ ID NO:45 and 
SEQ ID NO:47, as well as fragments thereof. The nucleotide 
sequences SEQ ID NO:l and SEQ ID NO:3 are also called the 
human moxl gene and the human mox2 gene in this appUcation. 

15 SEQ ID NO:22 is also known as the rat moxl gene in this 

application. SEQ ID NO:41 is also known as the rat moxlB 
gene in this application. SEQ ID NO:45 is also known as the 
human duoxl gene in this application. SEQ ID NO:47 is also 
known as the human duox2 gene in this application. 

20 The nucleotide sequences SEQ ID NO:l, SEQ ID 

NO:3, SEQ ID NO:22, SEQ ID NO:41 SEQ ID NO:45 and SEQ 
ID NO:47, as well as fragments thereof, may be used to create 
probes to isolate larger nucleotide sequences containing the 
nucleotide sequences SEQ ID NO:l, SEQ ID NO:3, SEQ ID 

25 NO:22, SEQ ID NO:41 SEQ ID NO:45 and SEQ ID NO:47, 

respectively. The nucleotide sequences SEQ ID NO:l, SEQ ID 
NO:3, SEQ ID NO:22, SEQ ID NO:41 SEQ ID NO:45 and SEQ 
ID NO:47, as well as fragments thereof, may also be used to 
create probes to identify and isolate mox and duox proteins in 

30 other species. 

The nucleic acids described herein include 
messenger RNA coding for production of SEQ ID NO:2, SEQ 
ID NO:4, SEQ ID NO:21, SEQ ID NO:42, SEQ ID NO:46, 
SEQ ID NO:48 and fragments thereof. Such nucleic acids 

35 include but are not limited to cDNA probes. These probes may 

be labeled in a variety of ways known to one of ordinary skill in 
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the art. Such methods include but are not limited to isotopic 
and non-isotopic labeling. These probes may be used for in situ 
hybridization for localization of nucleic acids such as mRNA 
encoding for SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:21, 
SEQ ID NO:42, SEQ ID NO:46, SEQ ID NO:48 and fragments 
thereof. Localization may be performed using in situ 
hybridization at both ultrastructural and light microscopic levels 
of resolution using techniques known to one of ordinary skill in 
the art. 

These probes may also be employed to detect and 
quantitate nucleic acids and mRNA levels using techniques 
known to one of ordinary skill in the art including but not 
limited to solution hybridization. 

Antibody Production 

The proteins shown in SEQ ID NO:2, SEQ ID 
NO:4, SEQ ID NO:21, SEQ ID NO:42, SEQ ID NO:46, SEQ 
ID NO:48, or fragments thereof, are combined with a 
pharmaceutically acceptable carrier or vehicle to produce a 
pharmaceutical composition and administered to animals for the 
production of polyclonal antibodies using methods known to one 
of ordinary skiU in the art. The preferred animals for antibody 
production are rabbits and mice. Other animals may be 
employed for inamunization with these proteins or fragments 
thereof. Such animals include, but are not limited to the 
following; sheep, horses, pigs, donkeys, cows, monkeys and 
rodents such as guinea pigs and rats. 

The temis "pharmaceutically acceptable carrier or 
pharmaceutically acceptable vehicle" are used herein to mean 
any liquid including but not limited to water or saline, oil, gel, 
salve, solvent, diluent, fluid ointment base, liposome, micelle, 
giant micelle, and the like, which is suitable for use in contact 
with living animal or human tissue without causing adverse 
physiological responses, and which does not interact with the 
other components of the composition in a deleterious maimer. 
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The pharmaceutical compositions may conveniently 
be presented in unit dosage form and may be prepared by 
conventional pharmaceutical techniques. Such techniques 
include the step of bringing into association the active ingredient 
and the pharmaceutical carrier(s) or excipient(s). In general, 
the fomiulations are prepared by uniformly and intimately 
bringing into association the active ingredient with liquid 
carriers. 

Formulations suitable for parenteral administration 
include aqueous and non-aqueous sterile injection solutions 
which may contain anti-oxidants, buffers, bacteriostats and 
solutes which render the formulation isotonic with the blood of 
the intended recipient; and aqueous and non-aqueous sterile 
suspensions which may mclude suspending agents and thickening 
agents. The formulations may be presented in unit-dose or 
multi-dose containers, for example, sealed ampules and vials, 
and may be stored in a freeze-dried (lyophilized) condition 
requiring only the addition of the sterile liquid carrier, for 
example, water for injections, immediately prior to use. 
Extemporaneous injection solutions and suspensions may be 
prepared from sterile powders, granules and tablets commonly 
used by one of ordinary skill in the art. 

Preferred unit dosage formulations are those 
containing a dose or unit, or an appropriate fraction thereof, of 
the administered ingredient. It should be understood that in 
addition to the ingredients, particularly mentioned above, the 
formulations of the present invention may include other agents 
commonly used by one of ordinary skill in the art. 

The pharmaceutical composition may be 
administered through different routes, such as oral, mcluding 
buccal and sublingual, rectal, parenteral, aerosol, nasal, 
intramuscular, subcutaneous, intradermal, and topical. The 
pharmaceutical composition of the present invention may be 
administered in different forms, mcluding but not limited to 
solutions, emulsions and suspensions, microspheres, particles, 
microparticles, nanoparticles, and liposomes. It is expected that 
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from about 1 to 7 dosages may be required per immunization 
regimen. Initial injections may range from about 0.1 fig to 1 
mg, with a preferred range of about 1 |Lig to 800 \xg, and a more 
preferred range of from approximately 25 ^ig to 500 ^g. 
Booster injections may range from 0.1 |Lig to 1 mg, with a 
preferred range of approximately 1 \ig to 800 |big, and a more 
preferred range of about 10 fig to 500 fig. 

The volume of administration will vary depending 
on the route of administration and the size of the recipient. For 
example, intramuscular injections may range from about 0.1 ml 
to 1.0 ml. 

The pharmaceutical composition may be stored at 
temperatures of from about 4^C to -lOO^C. The pharmaceutical 
composition may also be stored in a lyophilized state at different 
temperatures including room temperature. The pharmaceutical 
composition may be sterilized through conventional means 
known to one of ordinary skill in the art. Such means include, 
but are not limited to filtration, radiation and heat. The 
pharmaceutical composition of the present invention may also 
be combined with bacteriostatic agents, such as thimerosal, to 
inhibit bacterial growth. 

Adjuvants 

A variety of adjuvants known to one of ordinary 
skill in the art may be administered in conjunction with the 
protein in the pharmaceutical composition. Such adjuvants 
include, but are not limited to the following: polymers, co- 
polymers such as polyoxyethylene-polyoxypropylene 
copolymers, including block co-polymers; polymer P1005; 
Freund's complete adjuvant (for animals); Freund's incomplete 
adjuvant; sorbitan monooleate; squalene; CRL-8300 adjuvant; 
alum; QS 21, muramyl dipeptide; trehalose; bacterial extracts, 
including mycobacterial extracts; detoxified endotoxins; 
membrane lipids; or combinations thereof. 

Monoclonal antibodies can be produced using 
hybridoma technology in accordance with methods well known 
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to those skilled in the art. The antibodies are useful as research 
or diagnostic reagents or can be used for passive immunization. 
The composition may optionally contain an adjuvant. 

The polyclonal and monoclonal antibodies useful as 
research or diagnostic reagents may be employed for detection 
and measurement of SEQ ID NO:2, SEQ ID NO:4, SEQ ID 
NO:21, SEQ ID NO:42, SEQ ID NO:46, SEQ ID NO:48 and 
fragments thereof. Such antibodies may be used to detect these 
proteins in a biological sample, including but not Hmited to 
samples such as cells, cellular extracts, tissues, tissue extracts, 
biopsies, tumors, and biological fluids. Such detection 
capability is useful for detection of disease related to these 
proteins to facilitate diagnosis and prognosis and to suggest 
possible treatment alternatives. 
15 Detection may be achieved through the use of 

immunocytochemistry, ELISA, radioimmunoassay or other 
assays as commonly known to one of ordinary skill in the art. 
The moxl, mox2, duoxl and duox2 proteins, or fragments 
thereof, may be labeled through commonly known approaches, 
20 including but not limited to the following: radiolabeling, dyes, 

magnetic particles, biotin-avidin, fluorescent molecules, 
chemiluminescent molecules and systems, ferritin, colloidal 
gold, and other methods known to one of skill in the art of 
labeling proteins. 

25 

Administration of Antibodies 

The antibodies directed to the proteins shown as 
SEQ ID N0:2, SEQ ID NO:4, SEQ ID NO:21, SEQ ID NO:42, 
SEQ ID NO:46 or SEQ ID NO:48, or directed to fragments 
thereof, may also be administered directly to humans and 
animals in a passive inmiunization paradigm. Antibodies 
directed to extracellular portions of SEQ ID NO:2, SEQ ID 
NO:4, SEQ ID NO:21, SEQ ID NO:42, SEQ ID NO:46 or SEQ 
ID NO:48 bind to these extracellular epitopes. Attachment of 
35 labels to these antibodies facilitates localization and visualization 

of sites of binding. Attachment of molecules such as ricin or 
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Other cytotoxins to these antibodies helps to selectively damage 
or kill cells expressing SEQ ID NO:2, SEQ ID NO:4, SEQ ID 
NO:21, SEQ ID NO:42, SEQ ID NO:46, SEQ ID NO:48 or 
fragments thereof. 

5 

Kits 

The present invention includes kits useful with the 
antibodies, nucleic acids, nucleic acid probes, labeled antibodies, 
labeled proteins or fragments thereof for detection, localization 
10 and measurement of SEQ ID NO:l, SEQ ID NO:2, SEQ ID 

NO:3, SEQ ID NO:4, SEQ ID NO:21, SEQ ID NO:22, SEQ ID 
NO:41, SEQ ID NO:42, SEQ ID NO:45, SEQ ID NO:46, SEQ 
ID NO:47, SEQ ID NO:48 or combinations and fragments 
thereof. 

^5 Kits may be used for inununocytochemistry, in situ 

hybridization, solution hybridization, radioimmunoassay, 
ELISA, Western blots, quantitative PGR, and other assays for 
the detection, localization and measurement of these nucleic 
acids, proteins or fragments thereof using techniques known to 

20 one of skill in the art. 

The nucleotide sequences shown in SEQ ID NO:l, 
SEQ ID NO:3, SEQ ID NO:22, SEQ ID NO:41 SEQ ID NO:45, 
SEQ ID NO:47, or fragments thereof, may also be used under 
high stringency conditions to detect alternately spliced messages 

25 related to SEQ ID NO:l, SEQ ID NO:3, SEQ ID NO:22, SEQ 

ID NO:41 SEQ ID NO:45, SEQ ID NO:47 or fragments 
thereof, respectively. 

As discussed in one of the Examples, rat moxl 
protein (SEQ ID NO: 21) is similar to mouse gp91 protein 

30 (SEQ ID NO: 38), whereas rat moxlB protein (SEQ ID NO:42) 

is similar to human gp91 protein (SEQ ID NO: 12). This 
observation suggests that other isoforms of mouse and human 
gp91 may exist. In addition, another subtype of human moxl, 
similar to rat moxlB (SEQ ID NO:42), also exists. The 

35 presence of two isoforms of rat moxl protein in vascular 

smooth muscle may have important physiological consequences 
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and biomedical applications. For example, the two isoforms 
may have different biological activities, different tissue 
distributions and may be regulated differently in physiological 
and/or pathological conditions. The fact that moxlB (SEQ ID 
5 NO:42) was isolated from cells exposed to angiotensin II, 

known to promote oxidative stress and vascular growth, 
suggests that it may be upregulated by this hormone and may be 
overexpressed in disease. Therefore, the diagnostic kits of the 
present invention can measure the relative expression of the two 

10 mox isoforms. The diagnostic kits may also measure or detect 

the relative expression of the mox proteins described herein 
(i.e. human moxl and/or human mox2) and duox proteins 
described herein (i.e. human duoxl and/or human duox2). 

Fragments of SEQ ID NO:l, SEQ ID NO:3, SEQ 

15 ID NO:22, SEQ ID NO:41 SEQ ID NO:45 and SEQ ID NO:47 

containing the relevant hybridizing sequence can be synthesized 
onto the surface of a chip array. RNA sanq)les, e.g., from 
tumors, are then fluorescently tagged and hybridized onto the 
chip for detection. This approach may be used diagnostically to 

20 characterize tumor types and to tailor treatments and/or provide 

prognostic information. Such prognostic information may have 
predictive value conceming disease progression and life span, 
and may also affect choice of therapy. 

The present invention is further illustrated by the 

25 following examples, which are not to be constraed in any way 

as imposing limitations upon the scope thereof. On the 
contrary, it is to be clearly understood that resort may be had to 
various other embodiments, modifications, and equivalents 
thereof, which, after reading the description herein, may 

30 suggest themselves to those skilled in the art without departing 

from the spirit of the present invention. 
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EXAMPLE 1 

Sequence Analysis and Cloning of the Human moxl cDNA 
(SEQ ID NO: J) Encoding for Production of the Human moxl 
Protein p65mox (SEQ ID NO:2 ) 
5 Using gp91phox as a query sequence, a 334 base 

sequenced portion of expressed sequence tag (EST) 176696 
(GenBank Accession number AA305700) showed 68.8% 
sequence identity at the predicted amino acid level with human 
(h) gp91phox. The bacterial strain number 129134 containing 

10 the EST sequence in the pBluescript SK" vector, was purchased 

from American Tissue Type Culture Collection (ATGC, 
Rockville, MD). The EST sequence was originally cloned from 
a Caco-2 human colon carcinoma cell line. The ESTl 76696 
DNA was ftirther sequenced using the T7 and T3 vector 

15 promoters and primers designed to match the known 3' 

sequence. Internal primers used for sequencing were as 
follows: 5'-AAC AAG COT GGC TTC AGC ATG-3' SEQ ID 
NO:5 (25 IS, numbering is based on the nucleotides from the 5' 
end of EST 176696, and S indicates the sense direction), 5'- AGC 

20 AAT ATT GTT GGT CAT-3' SEQ ID NO:6 (336S), 5'-GAC 

TTG ACA GAA AAT CTA TAA GGG-3' SEQ ID NO:7 
(393S), 5-TTG TAC CAG ATG GAT TTC AA-3' SEQ ID 
NO:8 (673A, A indicates the antisense direction), 5'-CAG GTC 
TGA AAC AGA AAA CCT-3' SEQ ID NO:9 (829S), 5'- ATG 

25 AAT TCT CAT TAA TTA TTC AAT AAA-3' SEQ ID NO: 10 

(1455 A). The coding sequence in ESTl 76696 showed 
homology to a 250 amino acid stretch corresponding to the N- 
tenninal 44% of human gp91phox, and contained a stop codon 
corresponding to the location in human gp91phox. 5' Rapid 

30 amplification of cDNA ends (RACE) was carried out using a 

human colon cDNA library and Marathon cDNA Amplification 
Kit (ClonTech) using 5'-ATC TCA AAA GAC TCT GCA CA- 
3* SEQ ID NO: 11 (41 A) as an internal gene-specific primer 
(Frohman et al. (1988) Proc. Natl. Acad. Sci. USA 85, 8998- 

35 9002). 5' RACE resulted in a 1.1 kb fragment representing the 

complete 5' sequence, based on homology with gp9lphox. 
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Reamplification was performed with primers spanning the 
putative start and stop codons, using the LI kb 5' RACE 
product and pSK-EST176696 for primer design. The amplified 
L7kb fragment was TA cloned into the PCR2.1 vector 
5 (Invitrogen, Carlsbad, CA). This recombinant vector is 

referred to as PCR-mox. 

Figure l(a-d) presents a comparison of the present 
amino acid sequences of human, bovine and murine gp91 phox 
with the human and rat moxl proteins of the present invention 
10 and the human duox2 protein of the present invention. Also 

shown are the amino acid sequences for related plant enzyme 
proteins. 

The encoded hp65mox ("mox" referring to 
mitogenic oxidase and "65" referring to its predicted molecular 

15 weight) is listed as SEQ ID NO:2. h-gp91phox (SEQ ID 

NO: 12) and SEQ ID NO:2 differ in length by 3 residues and are 
70% identical in their amino acid sequence. h-gp91phox and 
SEQ ID NO:2 show a greater percentage identity in the C- 
tenninal half of the molecule which contains the putative 

20 NADPH and FAD binding sites, and there are several relatively 

long stretches of complete identity within this region. 

A dendrogram (Figure 2) comparing the amino 
acid sequences of mouse and human gp91phox with that of 
moxl SEQ ID NO:2 shows that the latter probably represents a 

25 distinct isoform of gp91phox. Two plant homologs of 

cytochrome b558 large subunit are also indicated and represent 
more distant relatives of the human sequences. Human (and rat 
moxl described more fiilly below) lack asparagine-linked 
glycosylation sites, which are seen in the highly glycosylated 

30 human and mouse gp91phox. Additionally, the hydropathy 

profiles of human gp91phox and moxl are nearly identical and 
include five very hydrophobic stretches in the amino-terminal 
half of the molecules which are predicted to be membrane- 
spanning regions. 

35 
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EXAMPLE 2 

Expression of Moxl 

Huxnan multiple tissue northern (MTN) Blot I and 
Hunaan MTN Blot IV (ClonTech) membranes were hybridized 
5 with the putative coding region of the PCR-mox vector at 68'C 

for several hours. The mox coding region was labeled by 
random priming with [a-32p]dCTP (10 \xCi) using the Prime-It 
II kit (Stratagene). For analysis of moxl expression in cell 
lines, total RNA was prepared from 10^ cells using the High 

10 Pure RNA Isolation Kit (Boehringer Mannheim) or RNeasy kit 

(Quiagen), Total RNA (10-20 |Lig) was separated on a 1% 
agarose formaldehyde mini-gel and transferred to a Nytran 
filter (Biorad) and immobilized by ultraviolet cross-linking. 

Northern blotting revealed that the major location 

15 of mRNA coding for the moxl protein was colon. The message 

was also detected in prostate and uterus. The human colon- 
carcinoma cell line, Caco-2, also expressed large quantities of 
moxl message. Northern blotting of mRNA from rat aortic 
smooth muscle cells also showed strong hybridization, which 

20 increased roughly two-fold within 12 hours after treatment with 

platelet-derived growth factor. This increase in the expression 
of rat moxl is consistent with the idea that moxl contributes to 
the growth-stimulatory effects of PDGF. 

25 EXAMPLE 3 

Transfection ofNIH3T3 Cells with SEQ ID NO:l 

The nucleotide sequence (SEQ ID NO:l) encoding 
for production of the moxl protein (SEQ ID NO:2) was 
subcloned into the Notl site of the pEF-PAC vector (obtained 

30 from Mary Dinauer, Indiana University Medical School, 

Indianapolis, IN) which has a puromycin resistance gene. 
Transfection was carried out as described in Sambrook et al., 
Molecular Cloning, A Laboratory Manual, Volumes 1-3, 2nd 
edition, Cold Spring Harbor Laboratory Press, N.Y., 1989. 

35 The SEQ ID NO:l in pEF-PAC and the empty vector were 

separately transfected into NIH 3T3 cells using Pugene 6 
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(Boeringer Mannheim). About 2 x 10^ cells maintained in 
DMEM containing 10% calf serum were transfected with 10 ng 
of DNA. After 2 days, cells were split and selected in the same 
medium containing Img/ml puromycin. Colonies that survived 
5 in selection media for 10 to 14 days were subcultured 

continuously in the presence of puromycin. 

Transfected cells exhibited a "transformed"-like 
morphology, similar to that seen with (V12)Ras-transfected 
cells, characterized by long spindle-like cells. The parent NTH 
10 3T3 cells or cells transfected with the empty vector showed a 

normal fibroblast-like morphology. 

EXAMPLE 4 

Expression of Moxl (SEQ ID NO:l) in Transfected NIH3T3 
15 Cells 

To verify the expression of moxl mRNA after 
transfection, RT-PCR and Northern blotting were performed. 
Total RNAs were prepared from 106 cells using the ffigh Pure 
RNA Isolation Kit (Boeringer Mannheim) or RNeasy kit 

20 (Qiagen). cDNAs for each colony were prepared from 1-2 ng 

of total RNA using Advantage RT-PCR Kit (ClonTech). PCR 
amplification was performed using primers, 5'-TTG GCT AAA 
TCC CAT CCA-3' SEQ ID NO: 13 (NN459S, numbering 
containing NN indicates numbering from the start codon of 

25 moxl) and 5'-TGC ATG ACC AAC AAT ATT GCT G-3' SEQ 

ID NO:14 (NN1435A). For Northern blotting, 10-20 ng of 
total RNA was separated on a 1% agarose formaldehyde gel and 
transferred to a nylon filter. After ultraviolet (UV) cross- 
linking, filters were used for Northern blotting assay as 

30 described in Example 2. 

Colonies expressing large amounts of moxl mRNA 
were chosen for further analysis. The expression of mRNA for 
glyceraldehyde 3 phosphate dehydrogenase in the various cell 
lines was normal. 

35 

Example 5 
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Colony Formation on Soft Agar 

105 to 103 cells stably transfected with human 
moxl gene SEQ ID NO:l and with empty vector were prepared 
in 0-3% warm (40^0 agar solution containing DMEM and 10% 
calf serum. Cells were distributed onto a hardened 0.6% agar 
plate prepared with DMEM and 10% calf serum. After three 
weeks in culture (370C, 5% CO2) colony foraiation was 
observed by microscopy. 

Cells which were stably transfected with the empty 
vector and cultured in soft agar for 3 weeks as above did not 
display anchorage independent growth. In contrast, NIH 3T3 
cells which had been stably transfected with the moxl (SEQ ID 
NO:l) and cultured for 3 weeks in soft agar demonstrated 
anchorage independent growth of colonies. 

EXAMPLE 6 
NADPH'Dependent Superoxide Generation Assay 

In one embodiment of the present invention, NIH 
3T3 cells stably transfected with the human moxl gene (SEQ ID 
NO:l) were analyzed for superoxide generation using the 
lucigenin (Bis-N-methylacridinium luminescence assay (Sigma, 
St. Louis, MO, Li et al. (1998) 7. Biol Chem. 273, 2015- 
2023). Cells were washed with cold HANKS' solution and 
homogenized on ice in HANKS' buffer containing 15% sucrose 
using a Dounce homogenizer. Cell lysates were frozen 
immediately in a dry ice/ethanol bath. For the assay, 30 ^ig of 
cell lysate was mixed with 200 ^iM NADPH and 500 \jM 
lucigenin. Luminescence was monitored using a LumiCounter 
(Packard) at three successive one minute intervals and the 
highest value was used for comparison. Protein concentration 
was determined by the Bradford method. 

Superoxide generation was monitored in lysates 
from some of the stably transfected cell lines and was compared 
with superoxide generation by the untransfected NIH 3T3 cell 
lysates. The results are shown in Table 4. Cell lines 26, 27, 
and 28 gave the highest degree of morphological changes by 
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microscopic examination corresponding to the highest degree of 
superoxide generation. The luminescent signal was inhibited by 
superoxide dismutase and the general flavoprotein inhibitor 
diphenylene iodonium, but was unaffected by added 
5 recombinant hunian p47phox, p67phox and RacUGTP-yS), 

which are essential cytosolic factors for the phagocyte 
respiratory-burst oxidase. 

Table 4 

10 Cell Line Name Superoxide Generation 

(RLU) 

Control (untransfected) 6045 

moxl-26 17027 

moxl-27 14670 
15 moxl-28 18411 

moxl-65 5431 

moxl-615 11331 

moxl-+3 8645 

moxl-4-10 5425 
20 moxl-pccl6 8050 

In an altemate and preferred embodiment of the present 
invention, cells that had been stably transfected with moxl 
(YA28) or with empty vector (NEF2) were grown in 10 cm 

25 tissue culture plates in medium containing DMEM, 10% calf 

serum, 100 units/ml penicillin, 100 |Lig/ml streptomycin, and 1 
M-g/ml puromycin to approximately 80% confluency. Cells (five 
tissue culture plates of each cell type) were washed briefly with 
5 ml phosphate buffered salme (PBS) then dissociated from the 

30 plates with PBS containing 5 mM EDTA. Cells were pelleted 

by centrifuging briefly at 1000 x g. 

To permeabilize the cells, freeze thaw lysis was carried 
out and this was followed by passage of the cell material 
through a sniall bore needle. The supernatant was removed and . 

35 the cells were frozen on dry ice for 15 minutes. After cells 

were thawed, 200 ^1 lysis buffer (HANKS' Buffered Salt 
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Solution - HBBS) containing a nuxture of protease inhibitors 
from Sigma (Catalog # P2714) was added. Cells on ice were 
passed through an 18 guage needle 10 times and 200 |Lil of BBSS 
buffer containing 34% sucrose was added to yield a final 
5 concentration of 17% sucrose. Sucrose appeared to enhance 

stability upon storage. The combination of fireeze-thawing and 
passage through a needle results in lysis of essentially all of the 
cells, and this material is referred to as the "cell lysate." 

The cell lysates were assayed for protein concentration 

10 using the BioRad protein assay system. Cell lysates were 

assayed for NADPH-dependent chemiluminescence by 
combining BBSS buffer, arachidonic acid, and 0.01 - 1 \ig 
protein in assay plates (96 well plastic plates). The reaction was 
initiated by adding 1.5 mM NADPH and 75 ^iM lucigenin to the 

15 assay mix to give a final concentration of 200 ^iM NADPH and 

10 fiM lucigenin, and the chemiluminescence was monitored 
inMnediately. The final assay volume as 150 |Lil. The optimal 
arachidonic acid concentration was between 50-100 |iM. A 
Packard Lumicount luminometer was used to measure 

20 chemiluminescence of the reaction between lucigenin and 

superoxide at 37**C. The plate was monitored continuously for 
60 minutes and the maximal relative luminescence uiiit (RLU) 
value for each sample was used for the graph. 

Figure 3 shows the RLU at various concentrations of cell 

25 lysates from moxl-transfected (YA28) and vector control 

(NEF2) cells. The presence of NaCl or KCl within a 
concentration range of 50-150 }iM is important for optimal 
activity. MgCl2(l-5 mM) further enhanced activity by about 2- 
fold. This cell-free assay for moxl NADPH-oxidase activity is 

30 useful for screening modulators (inhibitors^ or stimulators) of 

the moxl enzyme. The assay may also be used to detect mox 
and duox NADPH-oxidase activity in general and to screen for 
modulators (inhibitors or stimulators) of the mox and duox 
family of enzymes. 

35 
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EXAMPLE 7 

Nitro Blue Tetrazolium Reduction by Superoxide Generated by 
NIH 3T3 cells Transfected with the Moxl cDNA (SEQ ID 
NO:l) 

5 Superoxide generation by intact cells was 

monitored by using superoxide dismutase-sensitive reduction of 
nitroblue tetrazolium. NEF2 (vector alone control), YA26 
(moxl (SEQ ID NO:l)'transfected) and YA28 (moxl (SEQ ID 
NO:l)-transfected) cells were plated in six well plates at 

10 500,000 cells per well. About 24 hours later, medium was 

removed from cells and the cells were washed once with 1 mL 
Hanks solution (Sigma, St. Louis, MO). About 1 mL of filtered 
0.25% Nitro blue tetrazolium (NBT, Sigma) was added in 
Hanks without or with 600 units of superoxide dismutase 

15 (Sigma) and cells were incubated at 37°C in the presence of 5% 

COj. After 8 minutes the cells were scraped and pelleted at 
more than 10,000g. The pellet was re-suspended in 1 mL of 
pyridine (Sigma) and heated for 10 minutes at 100°C to 
solubilize the reduced NBT, The concentration of reduced NBT 

20 was determined by measuring the absorbance at 510 nm, using 

an extinction coefficient of 11,000 M-lcm-1. Some wells were 
untreated and used'to determine cell number. 

The data are presented in Table 5 and Figure 4 and 
indicate that the moxl (SEQ ID NO:l)-transfected cells 

25 generated significant quantities of superoxide. 

Table 5 

NBT Reduction fnmQls/106 cells^ - SOD + SOD 

vector control cells 2.5 ± 0.5 2.1 ± 0.5 

30 YA26 (moxl) cells 6.4 ± 0.2 3.410.1 

YA28 (moxl) ceUs 5.2 ± 0,6 3.4 ± 0.3 
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-SOD, and +SOD mean in the absence or presence of added 
superoxide dismutase, respectively. 

Because superoxide dismutase is not likely to penetrate cells, 
superoxide must be generated extracellularly. The amount of 
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superoxide generated by these cells is about 5-10% of that 
generated by activated human neutrophils. 

EXAMPLE 8 

Modification of Intracellular Components in Moxl Transfected 
Cells 

To test whether superoxide generated by moxl can affect 
intracellular "targets,'* aconitase activity in control and mox- 
transfected cell lines was monitored as described in Suh et al. 
(1999) Nature 401, 79-82. Aconitase contains a four-iron- 
sulphur cluster that is highly susceptible to modification by 
superoxide, resulting in a loss of activity, and has been used as a 
reporter of intra-cellular superoxide generation . Acotinase 
activity was determined as described in Gardner et al. (1995) 7. 
BioL Chem, 270, 13399-13405. Acotinase activity was 
significantly diminished in all three mox-transfected cell lines 
designated YA26, YA28 and YA212 as compared to the 
transfected control (Figure 5). Approximately 50% of the 
aconitase in these cells is mitochondrial, based on differential 
centrifugation, and the cytosolic and mitochondrial forms were 
both affected. Control cytosolic and mitochondrial enzymes 
that do not contain iron-sulfur centres were not affected. 
Superoxide generated in moxl -transfected cells is therefore 
capable of reacting with and modifying intracellular 
components. 

EXAMPLE 9 

Tumor Generation in Nude Mice Receiving Cells Transfected 
with the Human moxl cDNA (SEQ ID NO:l ) 

About 2 X 106 NIH 3T3 cells (either moxl- 
transfected with SEQ ID NO:l or cells transfected using empty 
vector) were injected subdermally into the lateral aspect of the 
neck of 4-5 week old nude mice. Three to six mice were 
injected for each of three moxl -transfected cell lines, and 3 
mice were injected with the cells transfected with empty vector 
(control). After 2 to 3 weeks, mice were sacrificed. The 
tumors were fixed in 10% formalin and characterized by 
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histological analysis. Tumors averaged 1.5 x 1 x 1 cm in size 
and showed histology typical of sarcoma type tumors. In 
addition, tumors appeared to be highly vascularized with 
superficial capillaries. Eleven of twelve mice injected with 
5 moxl gene-transfected cells developed tumors, while none of 

the three control animals developed tumors. 

In another study, 15 mice were injected with moxl- 
transfected NIH 3T3 cells. Of the 15 mice injected, 14 showed 
large tumors within 17 days of injection, and tumors showed 
10 expression of moxl mRNA. Histologically, the tumors 

resembled fibrosarcomas and were similar to ras-induced 
tumors. Thus, ras and moxl were similarly potent in their 
ability to induce tumorigenicity of NIH 3T3 cells in athymic 
mice. 

15 

EXAMPLE 10 

Demonstration of the Role of Moxl in Non-Cancerous Growth 

A role in normal growth was demonstrated in rat 
aortic vascular smooth-muscle cells by using antisense to rat 
20 moxl. Transfection with the antisense DNA resulted in a 

decrease in both superoxide generation and serum-dependent 
growth. Moxl is therefore implicated in normal growth in this 
cell type. 

25 EXAMPLE 11 

Expression of Human Moxl Protein (SEQ ID NO:2) in a 
Baculovirus Expression System 

SEQ ID NO:2 was also expressed in insect cells 
using recombinant baculovirus. To establish the p65moxl 
expressing virus system, the moxl gene (SEQ ID NO:l) was 
initially cloned into the pBacPAKS vector (Clontech, Palo Alto, 
CA) and recombinant baculovirus was constructed using 
standard methods according to manufacturer's protocols. 
Briefly, PGR amplified moxl DNA was cloned into the Kpnl 
35 and EcoRI site of the vector. Primers used for PGR 

amplification were: 5 -CAA GOT AGG TGT TGA GGA TOG 



30 
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GAA ACT-3' , SEQ ID NO: 15, and 5'-ACG AAT TCA AGT 
AAA TTA CTG AAG ATA C-3' , SEQ ID NO: 16. Sf9 insect 
cells (2 X 106 cells) were infected with 0.5 mg of linearized 
baculovirus DNA sold under the trademark BACULOGOLD® 
(PharMingen, San Diego, CA) and 5 mg pBacPAC8-p65moxl 
using Transfection Buffers A and B (PharMingen, San Diego, 
CA). After 5 days, the supematants containing recombinant 
viruses were harvested and amplified by infecting fresh sf9 cells 
for 7 days. Amplification was carried out three times and the 
presence of the recombinant virus containing moxl DNA was 
confirmed by PCR using the same primers. After three times 
amplification of viruses, plaque purification was carried out to 
obtain the high titer viruses. Approximately 2 x 10^ sf9 cells in 
agar plates were infected for 5 days with serial dilutions of 
15 virus and were dyed with neutral red for easy detection of virus 

plaques. Selected virus plaques were extracted and the presence 
of the human moxl DNA was confirmed again by PCR. 

EXAMPLE 12 

20 Cloning of a Rat Homolog ofp65mox (SEQ ID NO:2) 

cDNA clones of p65mox from a rat aortic smooth 
muscle cell have been obtained. RT-PCR (reverse transcription 
polymerase chain reaction) was carried out as follows: first 
strand cDNA synthesis was performed using total RNA from rat 

25 aortic vascular smooth muscle cells, oligo dT primer and 

superscript II reverse transcriptase, and followed by incubation 
with RNase H. Degenerate PCR primers were designed to 
anneal to conserved areas in the coding regions of h-moxl and 
gp91phox of human (X04011), mouse (U43384) and porcine 

30 (SSU02476) origin. Primers were: sense 5'- 

CCIGTITGTCGIAATCTGCTSTCCTT-3', SEQ ID NO: 17 and 
antisense 5'-TCCCIGCAIAICCAGTAGAARTAGATCTT-3*, 
SEQ ID NO: 18. A major PCR product of the expected 1.1 kb 
size was purified by agarose electrophoresis and used as 

35 template in a second PCR amplification reaction. 

An aliquot of the RT-PCR product was blunt- 



wo 00/28031 



PCT/US99/26592 



10 



.52 



ended, ligated into a modified Litmus 29 vector and used to 
transfomi XL 10 competent E. coli. Approximately 120 
bacterial colonies were screened for the presence of a full- 
length insert by direct PGR using vector primers and Taq 
polymerase. Plasmids were purified from 25 positive colonies 
and mapped by digestion with Bam HL Representative plasmids 
from each digestion pattern were partially sequenced. Five out 
of 25 clones contained non-specific amplification products and 
20 contained identical inserts similar to human (h)-moxl. One 
of the latter clones was fully sequenced and found to be 83% 
identical to h-moxl over 1060 nucleotides. A 1.1 kb probe was 
generated by PGR amplification of the insert of a rat moxl 
clone with the degenerate primers described above and used to 
hybridize to a Northern blot of rat vascular smooth muscle cell 
15 RNA, A single band, migrating between 28S rRNA and 18S 

rRNA, indicated the presence of a message with a size 
compatible to that of human mox-1 (2.6 kb). 

To obtain full-length rat moxl, 3* and 5* rapid 
amplification of cDNA ends (RAGE) reactions were performed 
20 as describe above, using the gene-specific primers 5'- 

TTGGGAGAGTGAGTGAGGATGTGTTG-3\ SEQ ID NO: 19 
and 5'-CTGTTGGCTTCTACTGTAGCGTTCAAAGTT-3\ 
SEQ ID NO:20 for 3' and 5' RACE, respectively. Single major 
1.5 kb and 850 bp products were obtained for 3' and 5' RACE, 
25 respectively. These products were purified by agarose gel 

eletrophoresis and reamplified with Taq polymerase. Both 
products were cloned into the pCR 2.1 vector and used to 
transform electrocompetent XLl blue E. colL The RACE 
products were sequenced and new terminal primers were 
30 designed: sense 5'- 

TTCTGAGTAGGTGTGCATTTGAGTGTCATAAAGAC-3' 
(SEQ ID NO:43), and antisense 5'- 
TTTTCCGTCAAAATTATAACTTTTTATTTTCTTTTTATA 
ACACAT-3' (SEQ ID NO:44). PCR amplification of rat 
35 VSMC cDNA was performed using these primers. 
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A single 2.6 kb product was obtained, ligated into 
pCR 2.1 and used to transform electrocompetent XLl blue E. 
coli. The insert was sequenced with 12 sense and 14 antisense 
primers. Its length is 2577 bp (including primer sequences), 
5 comprising a 1692 bp open reading frame, 127 bp 5* and 758 

bp 3' untranslated regions. The presence of six in-frame stop 
codons in the 5* untranslated region suggests that the full length 
coding region has been obtained. Consensus polyadenylation 
sequences are present at nucleotides 2201 and 2550. Conceptual 

10 translation yields a 563 amino acid peptide, one residue shorter 

than the human deduced sequence. This new amino acid 
sequence is more similar to human moxl SEQ ID NO:3 (82% 
identity) than to mouse gp91phox SEQ ID NO:38 (55% 
identity), suggesting that it is indeed rat moxl (SEQ ID NO:21). 

15 This rat (r) homolog of p65mox protein is called r-p65mox or 

p65mox/rat.pep and is shown as SEQ ID NO:21. The 
nucleotide sequence encoding for r-p65mox is shown as SEQ ID 
NO:22, 

20 Example 13 

Expression of rat (r)'p65mox mRNA in Vascular Smooth 
Muscle and Induction by Angiotensin II Platelet-Derived 
Growth Factor (PDGF), and Phorbol Myristic Acid (PMA) 

Using the partial cDNA clone from rat, we have 

25 examined cultured rat aortic smooth muscle cells for expression 

of message for r-p65mox. We have observed the mRNA for r- 
p65mox in these cells. It has been previously reported 
(Griendling et al. (1994) Circ. Res. 74, 1141-1148; Fukui et al. 
(1997) Circ. Res. 80, 45-51; Ushio-Fukai et al. (1996) J. Biol 

30 Chem. 271, 23317-23321) that in vitro or in vivo treatment 

with angiotensin II (All) is a growth stimulus for vascular 
smooth muscle cells, and that All induces increased superoxide 
generation in these cells. Platelet-derived growth factor 
(PDGF) and PMA are proliferative signals for vascular smooth 

35 muscle cells. We observed that the mRNA for r-p65mox was 

induced approximately 2-3 fold by angiotensin II (100 nM), 
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corresponding to the increased level of superoxide generation. 
Thus, the increased superoxide generation in these cells 
correlates with increased expression of the niRNA for this 
enzyme. The niRNA for r-p65mox also increased 2 or more 
5 fold in response to the growth stimulus PDGF (20 ng/ml), and 

2-3 fold in response to PMA. Quantitation by densitometry 
revealed that rat moxl message was induced nearly 4-fold at the 
6 and 12 hour time points in response to PDGF, and about 2- 
fold at the 12 hour time point in response to AIL 28S RNA was 
10 used as a control for RNA recovery. 

EXAMPLE 14 

Antibodies to Fragments of Human (h)'p65mox (SEQ ID NO:2) 
Polyclonal antibodies were raised in rabbits against 

15 the C-terminal half of h-p65mox (residues 233 through 564, 

SEQ ID NO:23) which is predicted to fold into a cytosolic 
domain containing FAD and the NADPH or NADH binding 
site. This domain was expressed in E, coli as an N-terminal 
GST-fusion protein and was purified on glutathione agarose by 

20 standard methods. Two antipeptide antibodies were also made 

against h-p65mox (residues 243-256, referred to as Peptide A, 
SEQ ID NO:24) and h-65mox (residues 538-551, referred to as 
Peptide B, SEQ ID NO:25). Peptides were conjugated to 
keyhole limpet hemocyanin (KLH) using glutaraldehyde. 

25 Antigens were injected into different rabbits 

initially in complete Freund's adjuvant, and were boosted 4 
times with antigen in incomplete Freund's adjuvant at intervals 
of every three weeks. Approximately 0.5 mg to 1 mg of 
peptide was administered at each injection. Blood was drawn 1 

30 week after each boost and a terminal bleed was carried out 2 

weeks after the final boost. Antibodies to Peptide A and Peptide 
B were affinity purified by column chromatography through 
peptide A or peptide B conjugated to Affigel 15 (Bio-Rad, 
Richmond, CA). 10 mg of peptide was covalently crosslinked 

35 to 2 nil of Affigel 15 resin and the gel was washed with 20 ml 

of binding buffer (20 mM Hepes/NaOH, pH 7.0, 200 mM NaCl, 
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and 0,5 % Triton X-100). The remaining functional N- 
hydrosuccinimide was blocked with 100 ^il of 1 M 
ethanolamine. After washing with 20 ml of binding buffer, 5 
ml of the antiserum was incubated with the pep A-conjugated 
5 Affigel 15 resin overnight at 4*'C. Unbound protein was washed 

away with 20 ml of binding buffer. Elution of the antibodies 
from the gel was performed with 6 ml of elution buffer (100 
mM glycine/HCl, pH 2.5, 200 mM NaCl, and 0.5% Triton X- 
100). The eluate was then neutralized by adding 0.9 ml of 1 M 

10 Tris/HCl, pH 8.0. The GST-fusion form of trancated p65moxl 

protein (residues 233-566, SEQ ID NO:23) was expressed in E, 
coll. Samples (20 ^ig each) were run on 12 % SDS-PAGE 
either before or 1 or 4 hours after induction with 100 jifM IPTG 
(isopropyl p-thiogalactoside). 

15 The extracted proteins were subjected to 

inmiunoprobing with affinity purified antiserum to peptide A at 
a 1:1000 dilution. The detection of antigens was performed 
using an enhanced chemiluminescence kit (Amersham, 
Buckinghamshire, UK). The affinity purified antibody to moxl 

20 (243-256, SEQ ID NO:24) was used at a dilution of 1:1000 in a 

Western blot in which a total of 10 ^ig of protein was added to 
each lane. The major band observed at 4 hours after IPTG 
induction corresponded to the size of the GST-moxl expressed 
in bacteria containing the pGEX-2T vector encoding the GST- 

25 moxl fusion protein. 

Example 15 

Presence of an NAD(P)H Oxidase in Ras-Transformed 
Fibroblasts 

30 A superoxide-generating NADPH oxidase activity 

was detected in homogenates from NIH 3T3 cells, and this 
activity increased about 10-15 fold in Ras-transformed NIH 3T3 
cells (Table 6). To establish the stable Ras-transformed cell 
lines, the DNA for human Ras encoding an activating mutation 

35 at amino acid number 12 (Valine, referred to as V12-Ras) was 

subcloned into BamHl and EcoRl sites of pCDNA3 vector 
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which has a neomycin resistance gene. V12-Ras in pCDNA3 
and enfipty vector were transfected into NIH 3T3 cells using 
Lipofectamine Plus (Gibco). 2 X 10^ cells were maintained 
with DMEM containing 10% calf serum and transfected with 1 
mg of DNA. After 2-days, cells were split and selected with the 
same medium but containing 1 mg/ml neomycin. Colonies 
surviving in selection media for 10 to 14 days were sub- 
cultured and characterized by immunoblot analysis using 
antibody against human H-Ras. 

The expression of Ras in cells transfected with 
pcDNA-3 vector alone or in three cell lines transfected with 
V12-Ras in the same vector was analyzed on a Western blot. 
The three cell lines were named V12-Ras-7, V12-Ras-4, and 
V12-Ras-8. The expression of V12-Ras varied widely among 
the three cell Imes tested. The V12-Ras-4 cell line expressed the 
highest level of Ras followed by the V12-Ras-8 cell line. The 
V12-Ras-7 cell line expressed the lowest level of Ras. 

Lysates from each of these lines were then prepared 
and tested for their ability to generate superoxide. For each cell 
line, cells were washed with cold HANKS' balanced salt solution 
(HBSS), collected by centrifugation, kept on dry-ice for more 
than 30 min, and disrupted by suspending in low salt buffer 
(LSB; 50 mM Tris/HCl, pH 7.5, 1 mM PMSF, and protease 
cocktail from Sigma) and passing through a syringe needle (18 
gauge) ten times. Cell lysates were frozen in dry-ice 
immediately after determining the protein concentration. 

Table 6 shows superoxide generation in the 
transfected cells measured using the lucigenin luminescence 
assay. For the assay, 5 [ig of cell lysates were incubated with 
the reaction mixture containing 10 |LiM lucigenin (luminescent 
probe) and 100 |LiM NADPH (substrate) in the presence or 
absence of 100 ^iM arachidonate in the absence or presence of 
100 U of superoxide dismutase (SOD) or 1 jiM 
diphenyleneiodonium (DPI). Luminescence of the reaction 
mixture was monitored for 0.5 second by LumiCounter 
(Packard) for four times at 3 second intervals. RLU in Table 5 
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refers to relative luminescence units. 

As shown in Table 6, the luminescence was 
partially inhibited by superoxide dismutase indicating that the 
signal was due at least in part to the generation of superoxide. 
DPI, a known inhibitor of both neutrophil and non-neutrophil 
NADPH oxidase activities, completely inhibited activity. The 
generation of superoxide correlated with the exiM-ession of Ras 
in the three cell lines. Thus, oncogenic Ras appears to induce an 
NADPH-dependent superoxide generating activity that is similar 
to the activity catalyzed by p65moxl. 



VcctOT Control (1) 
V12-Ras-7 (2) 
V12-Ras-4 (3) 
V12-Ras-8 (4) 



Table 6 



RLU/5 Jig protein 


no additions 


plus SOD 


465 


154 


1680 


578 


5975 


2128 


4883 


2000 



plus DPI 
48 
39 
36 
35 



Example 16 

Molecular Cloning of Another Rat moxl cDNA Called Rat 
moxlB 

A rat cDNA library was screened in an effort to 
identify new rat mox sequences. The library was constructed in 
a ZAP express lambda phage vector (Stratagene, La Jolla, CA) 
using RNA isolated from rat vascular smooth muscle cells 
which had been exposed to 100 nM angiotensin II for 4 hours. 
The library was screened using standard blot hybridization 
techniques with the rat moxl probe described previously. 
Fifteen individual clones were obtained that were characterized 
by PGR and restriction mapping. Two different types of clones 
were thus identified and representatives of each type were 
sequenced. A clone of the first type (representative of 13) was 
found to be similar to the previously identified rat moxl and 
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was thus named rat moxlB. Clones of the second type 
(representative of 2) were inconfiplete rat mox sequences. 

The length of the rat mox IB nucleotide sequence is 
2619 bp and is listed as SEQ ID NO:4L The single longest 
5 1497 bp open reading frame encompasses nucleotides 362 to 

1858. The presence of two in-frame stop codons in the 5' 
untranslated region at nucleotides 74 and 257 indicates that the 
full-length coding region has been isolated. Two putative 
polyadenylation sites are present at positions 2243 and 2592. 

10 Alignment of the rat moxl nucleotide sequence (SEQ ID 

NO:22) and the rat moxlB nucleotide sequence (SEQ ID 
NO:41) shows that the two nucleotides sequences are identical 
except at their 5' ends, suggesting that they may represent two 
alternatively spliced messages from the same gene. Sequence 

15 identity starts at nucleotides 269 and 311, for rat moxl and rat 

moxlB, respectively. 

Conceptual translation of the rat mox IB nucleotide 
sequence (SEQ ID NO:41) yields a 499 amino acid sequence 
with a predicted molecular weight of 58 kDa. This amino acid 

20 sequence for rat moxlB protein is shown in SEQ ID NO:42. 

Alignment of the deduced amino acid sequences for rat moxl 
(SEQ ID NO:21) and rat moxlB (SEQ ID NO:42) indicates that 
rat moxlB is identical to rat moxlA, except for a missing 
stretch of 64 residues at the N-terminus. Therefore, rat moxlB 

25 appears to be a splicing variant derived from the same gene as 

rat moxl. 

EXAMPLE 17 

Sequence Analysis and Cloning of the Human Moxl cDNA 
30 (SEQ ID NO: 3) Encoding for Production of the Human Mox2 

Protein (SEQ ID NO: 4) 

Note that the mox2 protein as described herein, was 
described in U.S. Provisional Application Serial No. 60/149,332 
as mox3. 

35 A blast search was carried out using the sequence of 

moxl as a query sequence. The sequence identified by this 
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search was a sequence present in the GenBank database that 
contains regions of homology with moxl and gp91phox. The 
GenBank sequence located in the search was a 90.6 kb 
sequenced region of human chromosome 6 (6q25.1-26) that was 
reported as a GenBank direct submission dated February 9, 
1999 and given the Accession No. AL031773. Sequencing was 
carried out as part of the human genome sequencing project by 
S. Palmer, at Sanger Centre, in Hinxton, Cambridgeshire, UK. 
The GenBank sequence was reported as being similar to 
"Cytochrome B" and was not reported as having any homology 
or relation to a mox protein. The sequence contained a 
theoretical andno acid sequence that was derived by computer 
using an algorithm that predicted intron/exon boundaries and 
coding regions. This predicted region contained a 545 amino 
acid sequence that was 56% identical to moxl and 58% identical 
to gp91phox. 

In the present invention, based on the GenBank 
genomic sequence and the homologies described above, several 
specific primers were designed and used to determine the tissue 
expression patterns of a novel mox protein, mox2, using Human 
Multiple Tissue PCR Panels (Clontech, Palp Alto, CA). The 
primers were as follows: Primer 1: 5'- 

CCTGACAGATGTATTTCACTACCCAG-3' (SEQ ID NO:49); 
Primer 2: 5'-GGATCGGAGTCACTCCCTTCGCTG-3* (SEQ 
ID NO:50); Primer 3: 5'- 

CTAGAAGCTCTCCTTGTTGTAATAGA-3' (SEQ ID NO:51); 
Primer 4: 5^-ATGAACACCTCTGGGGTCAGCTGA-3' (SEQ 
ID NO:52). It was determined that mox2 is expressed primarily 
in fetal tissues, with highest expression in fetal kidney, with 
expression also seen m fetal liver, fetal lung, fetal brain, fetal 
spleen and fetal thymus. Among 16 adult tissues tested, mox2 
expression was seen in brain, kidney, colon and lung, although 
levels of expression appeared to be very low. 

Additionally, the 5' RACE (RACE = Rapid 
Amplification of cDNA Ends) and 3* RACE techniques were 
used to complete the sequence of the 5' and 3' regions of mox2. 
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(5' RACE kit and 3' RACE kit were from Clontech, Palo Alto, 
CA and are more fully described in Frohman et al. (1988) 
Proc. Natl. Acad. Sci. USA 85, 8998-9002. The 5' RACE and 
3'-RACE techniques were carried out using a human fetal 
kidney library (Marathon-Ready cDNA library. Cat. #7423-1), 
using the following specific primers: 5'-RACE: Primer 4: 5'- 
ATGAACACCTCTGGGGTCAGCTGA-3' (SEQ ID NO:53); 
Primer 5: 5'-GTCCTCTGCAGCATTGTTCCTCTTA-3' (SEQ 
ID NO:54); 3'-RACE: Primer 1: 5'- 
CCTGACAGATGTATTTCACTACCCAG-3' (SEQ ID NO:55); 
Primer 2: 5'-GGATCGGAGTCACTCCCTTCGCTG-3' (SEQ 
ID NO:56). The RACE procedures were successful in 
completing the 5' sequence and in confirming the 3' sequence. 
The complete coding sequence of mox2 is shown in SEQ ID 
NO:2, while the predicted amino acid sequence of mox2 is 
shown in SEQ ID NO:4. 

In comparing the sequences of the present invention 
to the predicted coding regions of the GenBank sequence, the 
GenBank sequence did not contain a start codon, appeared to be 
missing approximately 45 base pairs at the N-terminus, and 
contained one other major difference in the predicted coding 
region which could have been due to inaccurate computer 
prediction of intron/exon boundaries. 

Example 18 

Sequence Analysis and Partial Cloning of the Human Duox2 
cDNA (SEQ ID NO:47) Encoding for Production of the Human 
Duoxl Protein (SEQ ID NO:48) 

A partial cDNA clone of duox2 was obtained as follows. 
A 535-base portion of an expressed sequence tag (EST 
zc92h03.rl; Genbank accession no. W52750) from human 
pancreatic islet was identified using the human gp91phox 
amino-acid sequence as a query in a Blast search. The bacterial 
strain #595758 containing the EST sequence zc92h03.rl in the 
pBluescript SK-vector was purchased from ATCC (Rockville, 
MD). The DNA inserted into the pBluescript SK-vector was 
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further sequenced using T7 and T3 vector promoters as well as 
sequence specific internal primers. The EST encoded 440 
amino acids showing a 24.4% identity to gp91phox, including a 
stop codon corresponding to the C-terminus of gp91phox. 5'- 
5 RACE was carried out using mRNA obtained from human 

colon carcinoma cells (CaCo2) and the Marathon cDNA 
Amplification Kit (ClonTech, Palo Alto). The following gene- 
specific primers were used for this procedure: 5'- 
GAAGTGGTGGGAGGCGAAGACATA-3' (SEQ ID NO:26) 
10 and 5'-CCTGTCATACCTGGGACGGTCTGG-3' (SEQ ID 

NO:27). 

The results of the 5 '-RACE yielded an additional 2 
kilobase of sequenced DNA but this region did not contain the 
start codon. To complete the sequence of the 5' and 3' regions 
of duox2, 5'- RACE and 3'-RACE were carried out using a 
human adult pancreas mRNA (Clontech, Palo Alto, CA) with 
the kit of 5' RACE System for Rapid AmpUficatioii of cDNA 
Ends version 2.0 (Gibco BRL, Gaithersburg, MD). PCR done 
using the following specific primers resulted in a total predicted 
20 amino acid sequence of about 1000 residues: 5'-RACE: Primer 

3: 5'-GAGCACAGTGAGATGCCTGTTCAG-3' (SEQ ID 
NO:28); Primer 4: 5'- 

GGAAGGCAGCAGAGAGCAATGATG-3* (SEQ ID NO:29) 
(for nested PCR); 3'-RACE Primer 5: 5'- 
25 ACATCTGCGAGCGGCACTTCCAGA-3' (SEQ ID NO:30) 

Primer 6: 5'-AGCTCGTCAACAGGCAGGACCGAGC-3' 
(SEQ ID NO:31) (for nested PCR). 



15 



30 



Example 19 

Sequence Analysis and Cloning of the Human Duoxl cDNA 
(SEQ ID NO:45) Encoding for Production of the Human Duoxl 
Protein (SEQ ID NO:46) 

A cDNA clone of duoxl was obtained as follows. A 
homologous .357-base portion of an expressed sequence tag 
35 (EST nr80dl2.sl; Genbank accession no. AA641653) from an 

invasive human prostate was identified by using the partial 
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duox2 predicted amino-acid sequence described above as a 
query in a Blast search. The bacterial strain #1441736 
containing the EST sequence nr80dl2.sl in the pBluescript SK- 
vector was purchased from ATCC (Rockville, MD). The DNA 
inserted into the pBluescript SK-vector was further sequenced 
using T7 and T3 vector promoters as well as sequence specific 
internal primers. The EST insert encoded 673 amino acids with 
no start or stop codons present. Northern Blot analysis of 
duoxl indicated the gene was about 5.5 kilobase pairs. To 
complete the sequence of 5' and 3' regions of duoxl, 5' RACE 
and 3'-RACE were carried out using a human adult lung mRNA 
(Clontech, Palo Alto, CA) with the kit of 5' RACE System for 
Rapid Amplification of cDNA Ends version 2.0 (Gibco BRL, 
Gaithersburg, MD). The RACE procedure was carried out 
15 using the following specific primers: 5'-RACE: Primer 5: 5'- 

GCAGTGCATCCACATCTrCAGCAC-3'(SEQ ID NO:32); 
Primer 6: 5'-GAGAGCTCTGGAGACACTTGAGTTC-3' 
(SEQ lb NO:33) (for nested PCR); 3'-RACE Primer 7: 5'- 
CATGTTCTCTCTGGCTGACAAG-3' (SEQ ID NO:34); 
20 Primer 8: 5'-CACAATAGCGAGCTCCGCTTCACGC-3' (SEQ 

ID NO:35) (for nested PCR). RACE procedures were 
successful in completing the 5' sequence and the 3' sequence of 
duoxl. The open reading frame is approximately 4563 base 
pairs. 

25 

Example 20 

Tissue Expression of Duoxl and Duox2 

Based on the duoxl sequence data, several specific 

primers were designed (Primer la: 5'- 
30 GCAGGACATCAACCCTGCACTCTC-3' (SEQ ID NO:36); 

Primer 2a: 5'-AATGACACTGTACTGGAGGCCACAG-3' 

(SEQ ID NO:57); Primer 3a: 5'- 

CTGCCATCTACCACACGGATCTGC-3' (SEQ ID NO:58); 

Primer 4a: 5'-CTTGCCATTCCAAAGCTTCCATGC-3* (SEQ 
35 ID NO:59) and used these to determine the tissue expression 

patterns of duoxl using Human Multiple Tissue PCR Panels 
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(Clontech, Palo Alto, CA). It was determined that duoxl is 
expressed primarily in lung, testis, placenta, prostate, pancreas, 
fetal heart, fetal kidney, fetal liver, fetal lung, fetal skeletal 
muscle and thymus, with highest expression in adult and fetal 
5 lung. Among 16 adult tissues and 8 fetal tissues tested, duoxl 

expression in brain, heart, kidney, colon, ovary, thymus, fetal 
brain and fetal spleen appeared to be low. 

Two duox2 specific primers were also used to 
determine the tissue expression patterns of duox2 using Human 

10 Multiple Tissue PGR (polymerase chain reaction) Panels 

(Clontech, Palo Alto, CA). (Primer lb: 5'- 
GTACAAGTCAGGACAGTGGGTGCG-3' (SEQ ID NO:60); 
Primer 2b: 5'-TGGATGATGTCAGCCAGCCACTCA-3' (SEQ 
ID NO: 61)). Duox2 is expressed primarily in lung, pancreas, 

15 placenta, colon, prostate, testis and fetal tissues, with highest 

expression in adult lung and fetal tissues. Among 16 adult 
tissues and 8 fetal tissues tested, duox2 expression in brain, 
heart, . kidney, liver, skeletal muscle, thymus and fetal brain 
appeared to be low. 

20 

EXAMPLE 21 
Role of Duoxl and Duoxl in Collagen Crosslinking 

To investigate a possible role for the human duoxl 
and duox2, the model organism Caenorhabditis elegans and a 

25 new reverse genetic tool, RNA interference (RNAi), were used 

to "knock out*' the homologues of duox in this organism (Fire et 
al. (1998) Nature 391, 806-811). This technique involved 
injection of double stranded RNA encoding a segment of Ce^ 
duoxl or Ce-duox2 into gonads of C. elegans N2 

30 hermaphrodites. Injected worms were then allowed to lay eggs, 

and the harvested eggs were allowed to develop and the Fl 
progeny were scored for phenotypes. This procedure has been 
documented to "knock-out" the expression of the gene of 
interest (Fire et al. (1998) Nature 391, 806-811). 

35 In the case of Ce-duoxl and Ce-duox2, the 

knockout animals resulted in a complex phenotype including 
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worms with large superficial blisters, short or "dumpy" worms, 
worms with locomotion disorders, and worms with retained 
eggs and/or larvae. Because of the high identity between Ce- 
duoxl and Ce-duox2, three different RNA constructs were 
predicted to knock out either both genes or Ce-duox2 alone. In 
all cases, essentially the same group of phenotypes was obtained. 
Most or all of these phenotypes had been described previously 
in C. elegans mutated in the collagen biosynthetic pathway. C. 
elegans has an extracellular structure known as the cuticle, a 
complex sheath composed largely of cross-linked collagen, 
which functions as the exoskeleton of the nematode. Cross- 
linking of collagen in nematodes occurs in part by cross-linking 
tyrosine residues, and peroxidases such as sea urchin 
ovoperoxidase and human myeloperoxidase have previously 
been shown to be capable of carrying out this reaction. 

Based upon the similarities of the phenotypes 
obtained, the Ce-duoxl/2 knockout worms were examined for 
the presence of dityrosine linkages, using an HPLC 
methodology (Andersen, S.O. (1966) Acta Physiol Scand. 66, 
Suppl. 263-265; Abdekahim et al. (1997) 7. Chromatogr, B 
Biomed, ScL AppL 696, 175-182). It was deteraained that 
dityrosine linkages. While easily detected in the wild type 
worms, were almost completely lacking in the knockout worms. 
Thus, an inability to catalyze dityrosine cross-linking accounts 
for the phenotype of C. elegans failing to express Ce-duoxl/2. 
These data support the concept that the duox enzymes in higher 
organisms can probably function in a similar manner to 
modulate the extracellular milieu, possibly the extracellular 
matrix and/or the basement membrane. 

All patents, publications and abstracts cited above 
are incorporated herein by reference in their entirety. It should 
be understood that the foregoing relates only to preferred 
embodiments of the present invention and that numerous 
modifications or alterations may be made therein without 
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departing from the spirit and the scope of the present invention 
as defined in the following claims. 
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CLAIMS 

1. A protein capable of stimulating superoxide production, 
wherein the protein comprises mox or duox, a fragment thereof 
or a conservative substitution thereof 

2. The protein of Claim 1, wherein the protein, the fragment 
thereof, or the conservative substitution thereof comprises the 
amino acid sequence of SEQ ID NO:2, SEQ ID NO:4, SEQ ID 
NO:21, SEQ ID NO:42, SEQ ID NO:46, or SEQ ID NO:48, a 
fragment thereof, or a conservative substitution thereof. 



3. A nucleotide sequence encoding for the protein, the 
fragment thereof or the conservative substitution thereof as 

15 recited in Claim 1. 

4. The nucleotide sequence of Claim 3, wherein the 
nucleotide sequence comprises SEQ ID NO:l, SEQ ID NO:3, 
SEQ ID NO:22, SEQ ID NO:41, SEQ ID NO:45, or SEQ ID 

20 NO:47, a fragment thereof, or a conservative substitution 

thereof. 

5. A vector, wherein the vector comprises a nucleotide 
sequence encoding for the protein, the fragment thereof or the 

25 conservative substitution thereof, as recited in Claim 1. 

6. The vector of Claim 5 wherein the nucleotide sequence 
comprises SEQ ID NO:l, SEQ ID NO:3, SEQ ID NO:22, SEQ 
ID NO:41, SEQ ID NO:45, or SEQ ID NO:47, a fragment 

30 thereof, or a conservative substitution thereof. 

7. A cell containing the vector of Claim 5. 

8. A cell containing the vector of Claim 6. 

35 
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9. An antibody, wherein the antibody is capable of binding 
to the protein, the fragment thereof, or the conservative 
substitution thereof, as recited in Claim L 

10. The antibody of Claim 9, wherein the protein, the 
fragment thereof, or the conservative substitution thereof, has 
the amino acid sequence of SEQ ID NO:2, SEQ ID NO:4, SEQ 
ID NO:21, SEQ ID NO:42, SEQ ID NO:46, or SEQ ID NO:48, 
a fragment thereof, or a conservative substitution thereof. 



11. A method of stimulating superoxide formation 
comprising administration, in vitro or in vivo, of a composition 
comprising the protein, the fragment thereof, or the 
conservative substitution thereof of Claim 1 in a 

15 pharmaceutically acceptable carrier. 

12. The method of Claim 11, wherein the protein, the 
fragment thereof, or the conservative substitution thereof, 
comprises the amino acid sequence of SEQ ID NO:2, SEQ ID 

20 NO:4, SEQ ID NO:21, SEQ ID NO:42, SEQ ID NO:46, or SEQ 

ID NO:48, a fragment thereof, or a conservative substitution 
thereof. 

13. A method of stimulating superoxide formation 
25 comprising administration, in vitro or in vivo, of a composition 

comprising the vector of Claim 5 in a pharmaceutically 
acceptable carrier. 

14. A method of stimulating superoxide formation 
30 comprising administration, in vitro or in vivo, of a composition 

comprising the vector of Claim 6 in a pharmaceutically 
acceptable carrier. 

15. A method for determining the activity of a drug 
35 comprising measuring the activity of the protein, the fragment 

thereof or the conservative substitution thereof, as recited in 
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Claim 1, to stimulate superoxide production following 
administration of the drug. 

16. The method of Claim 15, wherein the protein, the 
fragment thereof or the conservative substitution thereof 
comprises the amino acid sequence of SEQ ID NO:2, SEQ ID 
NO:4, SEQ ID NO:21, SEQ ID NO:42, SEQ ID NO:46, or SEQ 
ID NO:48, a fragment thereof, or a conservative substitution 
thereof. 
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319 
320 
320 
320 
319 
318 
300 
293 

349 
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330 
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E G0A LLQiE pHsE 
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SEQUENCE LISTING 

<110> Emory University 

<120> Novel Mitogenic Regulators 

<130> 05501-0103WP 

<140> 
<141> 

<160> 61 

<170> Patentin Ver. 2.0 

<210> 1 
<211> 2609 
<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 

<222> (207) . . (1901) 
<400> 1 

gctgatagca cagttctgtc cagagaagga aggcggaata aacttattca ttcccaggaa 60 

ctcttggggt aggtgtgtgt ttttcacatc ttaaaggctc acagaccctg cgctggacaa 120 

atgttccatt cctgaaggac ctctccagaa tccggattgc tgaatcttcc ctgttgccta 180 

gaagggctcc aaaccacctc ttgaca atg gga aac tgg gtg gtt aac cac tgg 233 

Met Gly Asn Trp Val Val Asn His Trp 
1 5 

ttt tea gtt ttg ttt ctg gtt gtt tgg tta ggg ctg aat gtt ttc ctg 281 
Phe Ser Val Leu Phe Leu Val Val Trp Leu Gly Leu Asn Val Phe Leu 
10 15 20 25 

ttt gtg gat gcc ttc ctg aaa tat gag aag gcc gac aaa tac tac tac 329 
Phe Val Asp Ala Phe Leu Lys Tyr Glu Lys Ala Asp Lys Tyr Tyr Tyr 
30 35 40 

aca aga aaa ate ctt ggg tea aca ttg gcc tgt gee cga gcg tct get 377 
Thr Arg Lys lie Leu Gly Ser Thr Leu Ala Cys Ala Arg Ala Ser Ala 
45 50 55 

etc tgc ttg aat ttt aac age acg ctg ate ctg ctt ect gtg tgt cgc 425 



1 
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Leu Cys Leu Asn Phe Asn Ser Thr Leu lie Leu Leu Pro Val Cys Arg 
60 €5 70 

aat ctg ctg tec ttc ctg agg ggc acc tgc tea ttt tgc age cgc aca 
Asn Leu Leu Ser Phe Leu Arg Gly Thr Cys Ser Phe Cys Ser Arg Thr 
75 80 85 

ctg aga aag caa ttg gat cac aac etc acc ttc cac aag ctg gtg gcc 
Leu Arg Lys Gin Leu Asp His Asn Leu Thr Phe His Lys Leu Val Ala 
90 95 100 105 

tat atg ate tgc eta cat aca get att cac ate att gea cac ctg ttt 
Tyr Met lie Cys Leu His Thr Ala He His He He Ala His Leu Phe 
110 115 120 

aac ttt gac tgc tat age aga age cga cag gcc aca gat ggc tec ctt 
Asn Phe Asp Cys Tyr Ser Arg Ser Arg Gin Ala Thr Asp Gly Ser Leu 
125 * 130 135 

gcc tec att etc tec age eta tet cat gat gag aaa aag ggg ggt tct 
Ala Ser He Leu Ser Ser Leu Ser His Asp Glu Lys Lys Gly Gly Ser 
140 145 150 

tgg eta aat cec ate cag tec cga aac acg aca gtg gag tat gtg aca 
Trp Leu Asn Pro He Gin Ser Arg Asn Thr Thr Val Glu Tyr Val Thr 
155 160 165 

ttc acc age gtt get ggt etc act gga gtg ate atg aca ata gee ttg 
Phe Thr Ser Val Ala Gly Leu Thr Gly Val He Met Thr He Ala Leu 
170 175 180 185 

att etc atg gta act tea get act gag ttc ate egg agg agt tat ttt 
He Leu Met Val Thr Ser Ala Thr Glu Phe He Arg Arg Ser Tyr Phe 
190 195 200 

gaa gtc ttc tgg tat act cac cac ctt ttt ate ttc tat ate ctt ggc 
Glu Val Phe Trp Tyr Thr His His Leu Phe He Phe Tyr He Leu Gly 
205 210 215 

tta ggg att cac ggc att ggt gga att gtc egg ggt caa aca gag gag 
Leu Gly He His Gly He Gly Gly He Val Arg Gly Gin Thr Glu Glu 
220 225 230 



age atg aat gag agt cat cct cgc aag tgt gca gag tct ttt gag atg 
Ser Met Asn Glu Ser His Pro Arg Lys Cys Ala Glu Ser Phe Glu Met 
235 240 245 

tgg gat gat egt gac tec cac tgt agg cgc cct aag ttt gaa ggg cat 
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473 



521 



569 



617 



665 



713 



761 



809 



857 



905 



953 



1001 
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Trp Asp Asp Arg Asp Ser His Cys Axg Arg Pro Lys Phe Glu Gly His 
250 255 260 265 



ccc cct gag tct tgg aag tgg ate ctt gca ccg gtc att ctt tat ate 1049 
Pro Pro Glu Ser Trp Lys Trp lie Leu Ala Pro Val lie Leu Tyr lie 
270 275 280 

tgt gaa agg ate etc egg ttt tac cgc tec cag cag aag gtt gtg att 1097 
Cys Glu Arg lie Leu Arg Phe Tyr Arg Ser Gin Gin Lys Val Val lie 
285 290 295 



ace aag gtt gtt atg cac eca tec aaa gtt ttg gaa ttg cag atg aae 1145 
Thr Lys Val Val Met His Pro Ser Lys Val Leu Glu Leu Gin Met Asn 
300 305 310 

aag cgt ggc tte age atg gaa gtg ggg cag tat ate ttt gtt aat tgc 1193 
Lys Arg Gly Phe Ser Met Glu Val Gly Gin Tyr He Phe Val Asn Cys 
315 320 325 

ccc tea ate tct etc etg gaa tgg cat cct ttt act ttg acc tct get 1241 
Pro Ser He Ser Leu Leu Glu Trp His Pro Phe Thr Leu Thr Ser Ala 
330 335 340 345 

cca gag gaa gat tte tte tec att cat ate cga gca gca ggg gae tgg 1289 
Pro Glu Glu Asp Phe Phe Ser He His He Arg Ala Ala Gly Asp Trp 
350 355 360 

aca gaa aat etc ata agg get tte gaa caa caa tat tea cca att ccc 1337 
Thr Glu Asn Leu He Arg Ala Phe GXu Gin Gin Tyr Ser Pro He Pro 
365 370 375 

agg att gaa gtg gat ggt ccc ttt ggc aca gcc agt gag gat gtt tte 1385 
Arg He Glu Val Asp Gly Pro Phe Gly Thr Ala Ser Glu Asp Val Phe 
380 385 390 



cag tat gaa gtg get gtg ctg gtt gga gca gga att ggg gtc acc ccc 1433 
Gin Tyr Glu Val Ala Val Leu Val Gly Ala Gly He Gly Val Thr Pro 
395 400 405 

ttt get tct ate ttg aaa tec ate tgg tac aaa tte cag tgt gca gae 1481 
Phe Ala Ser He Leu Lys Ser He Trp Tyr Lys Phe Gin Cys Ala Asp 
410 415 420 425 

cac aac etc aaa aca aaa aag ate tat tte tac tgg ate tgc agg ^ gag 1529 
His Asn Leu Lys Thr Lys Lys He Tyr Phe Tyr Trp He Cys Arg Glu 
430 435 440 

aca ggt gee ttt tec tgg tte aac aac ctg ttg act tec ctg gaa cag 1577 
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Thr Gly Ala Phe Ser Trp Phe Asn Asn Leu Leu Thr Ser Leu Glu Gin 
445 450 455 

gag atg gag gaa tta ggc aaa gtg ggt ttt eta aac tac cgt etc tte 1625 

Glu Met Glu Glu Leu Gly Lys Val Gly Phe Leu Asn Tyr Arg Leu Phe 

460 465 470 



etc acc gga tgg gac age aat att gtt 
Leu Thr Gly Trp Asp Ser Asn lie Val 
475 480 

gac aag gee act gac ate gtg aca ggt 
Asp Lys Ala Thr Asp lie Val Thr Gly 
490 495 

ggg aga cea atg tgg gac aat gag ttt 
Gly Arg Pro Met Trp Asp Asn Glu Phe 
510 



ggt cat gea gea tta aac ttt 1673 
Gly His Ala Ala Leu Asn Phe 
485 

ctg aaa cag aaa acc tec ttt 1721 
Leu Lys Gin Lys Thr Ser Phe 
500 505 

tct aca ata get acc tee cac 1769 
Ser Thr lie Ala Thr Ser His 
515 520 



ccc aag tct gta gtg gga gtt ttc tta tgt ggc cct egg act ttg gea 1817 
Pro Lys Ser Val Val Gly Val Phe Leu Cys Gly Pro Arg Thr Leu Ala 
525 530 535 



aag age ctg cgc aaa tgc tgt cac cga tat tec agt ctg gat cct aga 1865 
Lys Ser Leu Arg Lys Cys Cys His Arg Tyr Ser Ser Leu Asp Pro Arg 
540 545 550 

aag gtt caa tte tac ttc aac aaa gaa aat ttt tga gttataggaa 1911 
Lys Val Gin Phe Tyr Phe Asn Lys Glu Asn Phe 

555 560 565 

taaggacggt aatctgeatt ttgtctcttt gtatctteag taattgagtt ataggaataa 1971 

ggacggtaat ctgeattttg tctctttgta tcttcagtaa tttacttggt ctentcaggt 2031 

ttganeagtc actttaggat aagaatgtgc ctetcaagcc ttgactccet ggtattcttt 2091 

ttttgattge attcaactte gttacttgag ctteageaac ttaagaactt ctgaagttct 2151 

taaagttctg aanttcttaa agcccatgga tcctttctca gaaaaataac tgtaaatctt 2211 

tetggacage catgaetgta geaaggcttg atagcagaag tttggtggtt canaattata 2271 

caactaatcc caggtgattt tatcaattec agtgttacea tetcctgagt tttggtttgt 2331 

aatcttttgt cecteccace eccacagaag attttaagta gggtgacttt ttaaataaaa 2391 

atttattgaa taattaatga taaaacataa taataaacat aaataataaa caaaattacc 2451 
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gagaacccca tccccatata acaccaacag tgtacatgtt tactgtcact tttgatatgg 2511 
tttatccagt gtgaacagca atttattatt tttgctcatc aaaaaataaa ggattttttt 2571 
tcacttgaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaa 2609 



<210> 2 
<211> 564 
<212> PRT 

<213> Homo sapiens 
<400> 2 

Met Gly Asn Trp Val Val Asn His Trp Phe Ser Val Leu Phe Leu Val 
1 5 10 15 

Val Trp Leu Gly Leu Asn Val Phe Leu Phe Val Asp Ala Phe Leu Lys 
20 25 30 

Tyr Glu Lys Ala Asp Lys Tyr Tyr Tyr Thr Arg Lys lie Leu Gly Ser 
35 40 45 

Thr Leu Ala Cys Ala Arg Ala Ser Ala Leu Cys Leu Asn Phe Asn Ser 
50 55 60 

Thr Leu lie Leu Leu Pro Val Cys Arg Asn Leu Leu Ser Phe Leu Arg 
65 70 75 80 

Gly Thr Cys Ser Phe Cys Ser Arg Thr Leu Arg Lys Gin Leu Asp His 
85 90 95 

Asn Leu Thr Phe His Lys Leu Val Ala Tyr Met lie Cys Leu His Thr 
100 105 110 

Ala lie His lie lie Ala His Leu Phe Asn Phe Asp Cys Tyr Ser Arg 
115 120 125 

Ser Arg Gin Ala Thr Asp Gly Ser Leu Ala Ser lie Leu Ser Ser Leu 
130 135 140 

Ser His Asp Glu Lys Lys Gly Gly Ser Trp. Leu Asn Pro lie Gin Ser 
145 150 155 160 

Arg Asn Thr Thr Val Glu Tyr Val Thr Phe Thr Ser Val Ala Gly Leu 
165 170 175 

Thr Gly Val lie Met Thr lie Ala Leu lie Leu Met Val Thr Ser Ala 
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180 185 

Thr Glu Phe He Arg Arg Ser Tyr Phe Glu Val Phe Trp Tyr Thr His 
195 200 205 

His Leu Phe He Phe Tyr He Leu Gly Leu Gly He His Gly He Gly 
210 215 220 

Gly He Val Arg Gly Gin Thr Glu Glu Ser Met Asn Glu Ser His Pro 
225 230 235 240 

Arg Lys Cys Ala Glu Ser Phe Glu Met Trp Asp Asp Arg Asp Ser His 
245 250 255 

Cys Arg Arg Pro Lys Phe Glu Gly His Pro Pro Glu Ser Trp Lys Trp 
260 265 270 

He Leu Ala Pro Val He Leu Tyr He cys Glu Arg He Leu Arg Phe 
275 280 285 

Tyr Arg Ser Gin Gin Lys Val Val He Thr Lys Val Val Met His Pro 
290 295 300 

Ser Lys Val Leu Glu Leu Gin Met Asn Lys Arg Gly Phe Ser Met Glu 
305 310 315 320 

Val Gly Gin Tyr He Phe Val Asn Cys Pro Ser He Ser Leu Leu Glu 
325 330 335 

Trp His Pro Phe Thr Leu Thr Ser Ala Pro Glu Glu Asp Phe Phe Ser 
340 345 350 

He His He Arg Ala Ala Gly Asp Trp Thr Glu Asn Leu He Arg Ala 
355 360 365 

Phe Glu Gin Gin Tyr Ser Pro He Pro Arg He Glu Val Asp Gly Pro 
370 375 380 

Phe Gly Thr Ala Ser Glu Asp Val Phe Gin Tyr Glu Val Ala Val Leu 
385 390 395 400 

val Gly Ala Gly He Gly Val Thr Pro Phe Ala Ser He Leu Lys Ser 
405 410 415 

He Trp Tyr Lys Phe Gin Cys Ala Asp His Asn Leu Lys Thr Lys Lys 
420 425 430 

He Tyr Phe Tyr Trp He Cys Arg Glu Thr Gly Ala Phe Ser Trp Phe 
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435 440 445 

Asn Asn Leu Leu Thr Ser Leu Glu Gin Glu Met Glu Glu Leu Gly Lys 
450 455 460 

Val Gly Phe Leu Asn Tyr Arg Leu Phe Leu Thr Gly Trp Asp Ser Asn 
465 470 475 480 

lie Val Gly His Ala Ala Leu Asn Phe Asp Lys Ala Thr Asp lie Val 
485 490 495 

Thr Gly Leu Lys Gin Lys Thr Ser Phe Gly Arg Pro Met Trp Asp Asn 
500 505 510 

Glu Phe Ser Thr lie Ala Thr Ser His Pro Lys Ser Val Val Gly Val 
515 520 525 

Phe Leu Cys Gly Pro Arg Thr Leu Ala Lys Ser Leu Arg Lys Cys Cys 
530 535 540 

His Arg Tyr Ser Ser Leu Asp Pro Arg Lys Val Gin Phe Tyr Phe Asn 
545 550 555 560 

Lys Glu Asn Phe 



<210> 3 

<211> 2044 

<212> DHA 

<213> Homo sapiens 

<220> 
<221> CDS 

<222> (104) . . (1810) 
<400> 3 

caaagacaaa ataatttact agggaagccc ttactaacga cccaacatcc agacacaggt 60 

gagggagaag aaatttcctg acagccgaag agcaacaagt ate atg atg ggg tgc 115 

Met Met Gly Cys 
1 

tgg att ttg aat gag ggt etc tec acc ata tta gta etc tea tgg ctg 163 
Trp lie Leu Asn Glu Gly Leu Ser Thr lie Leu Val Leu Ser Trp Leu 
5 10 15 20 

gga ata aat ttt tat ctg ttt att gac acg ttc tac tgg tat gaa gag 211 
Gly lie Asn Phe Tyr Leu Phe lie Asp Thr Phe Tyr Trp Tyr Glu Glu 
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25 30 35 

gag gag tct ttc cat tac aca cga gtit att ttg ggt tea aca ctg get 259 
Glu Glu Ser Phe His Tyr Thr Arg Val lie Leu Gly Ser Thr Leu Ala 
40 45 50 

tgg gca cga gca tec gca ctg tgc ctg aat ttt aac tgc atg eta att 307 
Trp Ala Arg Ala Ser Ala Leu Cys Leu Asn Phe Asn Cys Met Leu lie 
55 €0 65 

eta ata cct gtc agt cga aac ctt att tea ttc ata aga gga aca agt 355 
Leu lie Pro Val Ser Arg Asn Leu lie Ser Phe lie TUrg Gly Thr Ser 
70 75 80 

att tgc tgc aga gga ecg tgg agg agg caa tta gae aaa aac etc aga 403 
lie Cys Cys Arg Gly Pro Trp Arg Arg Gin Leu Asp Lys Asn Leu Arg 
85 90 95 100 

ttt cac aaa ctg gtc gcc tat ggg ata get gtt aat gca acc ate cac 451 
Phe His Lys Leu Val TUa Tyr Gly lie Ala Val Asn Ala Thr He His 
105 110 115 

ate gtg gcg cat ttc ttc aac ctg gaa cge tac cac tgg age cag tec 499 
He Val Ala His Phe Phe Asn Leu Glu Arg Tyr His Trp Ser Gin Ser 
120 125 13P 

gag gag gcc cag gga ctt ctg gcc gca ctt tec aag ctg ggc aac acc 547 
Glu Glu Ala Gin Gly Leu Leu Ala Ala Leu Ser Lys Leu Gly Asn Thr 
135 140 145 

cct aac gag age tac etc aac cct gtc egg acc ttc ccc aca aac aca 595 
Pro Asn Glu Ser Tyr Leu Asn Pro Val Arg Thr Phe Pro Thr Asn Thr 
150 155 160 

acc act gaa ttg eta agg aca ata gca ggc gtc ace ggt ctg gtg ate 643 
Thr Thr Glu Leu Leu Arg Thr He Ala Gly Val Thr Gly Leu Val He 
165 170 175 180 

tct ctg get tta . gtc ttg ate atg acc teg tea act gag ttc ate aga 691 
Ser Leu Ala Leu Val Leu He Met Thr Ser Ser Thr Glu Phe He Arg 
185 190 195 

cag gcc tec tat gag ttg ttc tgg tac aca cac cat gtt ttc ate gtc 739 
Gin Ala Ser Tyr Glu Leu Phe Trp Tyr Thr His His Val Phe He Val 
200 205 210 

ttc ttt etc age ctg gcc ate cat ggg aeg ggt egg att gtt cga ggc 787 
Phe Phe Leu Ser Leu Ala He His Gly Thr Gly Arg He Val Arg Gly 
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215 220 225 

caa acc caa gac agt etc tct ctg cac aac ate aec ttc tgt aga gac 835 
Gin Thr Gin Asp Ser Leu Ser Leu His Asn lie Thr Phe Cys Arg Asp 
230 235 240 

cgc tat gca gaa tgg cag aca gtg gcc caa tgc ccc gtg cct caa ttt 883 
Arg Tyr Ala Glu Trp Gin Thr Val Ala Gin Cys Pro Val Pro Gin Phe 
245 250 255 260 

tct ggc aag gaa ccc teg get tgg aaa tgg att tta ggc cct gtg gtc 931 
Ser Gly Lys Glu Pro Ser Ala Trp Lys Trp lie Leu Gly Pro Val Val 
265 270 275 



ttg tat gca tgt gaa aga ata att agg ttc tgg cga ttt caa caa gaa 
Leu Tyr Ala Cys Glu Arg lie lie Arg Phe Trp Arg Phe Gin Gin Glu 
280 285 290 



979 



gtt gtc att acc aag gtg gta age cac ccc tct gga gtc ctg gaa ett 1027 

Val Val lie Thr Lys Val Val Ser His Pro Ser Gly Val Leu Glu Leu 
295 300 305 

cac atg aaa aag cgt ggc ttt aaa atg gcg cea ggg cag tac ate ttg 1075 

His Met Lys Lys Arg Gly Phe Lys Met Ala Pro Gly Gin Tyr lie Leu 
310 315 320 

gtg cag tgc cea gee ata tct teg ctg gag tgg cac ccc ttc ace ett 1123 

Val Gin Cys Pro Ala lie Ser Ser Leu Glu Trp His Pro Phe Thr Leu 

325 330 335 340 

acc tct gcc ccc cag gaa gac ttt ttc age gtg cac ate egg gca gca 1171 

Thr Ser Ala Pro Gin Glu Asp Phe Phe ser Val His lie Arg Ala Ala 
345 350 355 

gga gac tgg aca gca gcg eta ctg gag gee ttt ggg gca gag gga cag 1219 

Gly Asp Trp Thr Ala Ala Leu Leu Glu Ala Phe Gly Ala Glu Gly Gin 
360 365 370 

gcc etc cag gag ccc tgg age ctg cea agg ctg gca gtg gac ggg ccc 1267 

Ala Leu Gin Glu Pro Trp Ser Leu Pro Arg Leu Ala Val Asp Gly Pro 
375 380 385 

ttt gga act gcc ctg aca gat gta ttt cac tac cea gtg tgt gtg tgc 1315 

Phe Gly Thr Ala Leu Thr Asp Val Phe His Tyr Pro Val Cys Val Cys 
390 395 400 

gtt gcc gcg ggg ate gga gtc act ccc ttc get get ett ctg aaa tct 1363 

Val Ala Ala Gly lie Gly Val Thr Pro Phe Ala Ala Leu Leu Lys Ser 
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ata tgg tac aaa tgc agt gag gca cag acc cca ctg aag ctg age aag 1411 
lie Trp Tyr Lys Cys Ser Glu Ala Gin Thr Pro Leu Lys Leu Ser Lys 
425 430 435 

gtg tat ttc tac tgg att tgc egg gat gca aga get ttt gag tgg ttt 1459 
Val Tyr Phe Tyr Trp lie Cys Arg Asp Ala Arg Ala Phe Glu Trp Phe 
440 445 450 

get gat etc tta etc tec ctg gaa aca egg atg agt gag cag ggg aaa 1507 
Ala Asp Leu Leu Leu Ser Leu Glu Thr Arg Met Ser Glu Gin Gly Lys 
455 460 465 

act cac ttt ctg agt tat cat ata ttt ctt ace ggc tgg gat gaa aat 1555 
Thr His Phe Leu Ser Tyr His lie Phe Leu Thr Gly Trp Asp Glu Asn 
470 475 480 

cag get ctt cac ata get tta cac tgg gac gaa aat act gae gtg att 1603 
Gin Ala Leu His He Ala Leu His Trp Asp Glu Asn Thr Asp Val He 
485 490 495 500 

aca ggc tta aag cag aag acc ttc tat ggg agg ecc aac tgg aac aat 1651 
Thr Gly Leu Lys Gin Lys Thr Phe Tyr Gly Arg Pro Asn Trp Asn Asn 
505 510 . 515 

gag ttc aag cag att gee tac aat cac ccc age age agt att ggc gtg 1699 
Glu Phe Lys Gin He Ala Tyr Asn His Pro Ser Ser Ser He Gly Val 
520 525 530 

ttc ttc tgt gga cct aaa get etc teg agg aca ctt caa aag atg tgc 1747 
Phe Phe Cys Gly Pro Lys Ala Leu ser Arg Thr Leu Gin Lys Met Cys 
535 540 545 

cac ttg tat tea tea get gac ccc aga ggt gtt eat ttc tat tac aac 1795 
His Leu Tyr Ser Ser Ala Asp Pro Arg Gly Val His Phe Tyr Tyr Asn 
550 555 560 

aag gag age ttc tag actttggagg teaagtecag gcattgtgtt tteaatcaag 1850 

Lys Glu Ser Phe 

565 

ttattgatte eaaagaacte caccaggaat tcctgtgacg gcctgttgat atgagctcce 1910 
agttgggaac tggtgaataa taattaacta ttgtgaacag taeactatae cataettcet 1970 
tagcttataa ataacatgte atataeaaca gaacaaaaae atttaetgaa attaaaatat 2030 
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2044 



<210> 4 
<211> 568 
<212> PRT 

<213> Homo sapiens 
<400> 4 

Met Met Gly Cys Trp lie Leu Asn Glu Gly Leu Ser Thr lie Leu Val 
1 5 10 15 

Leu Ser Trp Leu Gly lie Asn Phe Tyr Leu Phe lie Asp Thr Phe Tyr 
20 25 30 

Trp Tyr Glu Glu Glu Glu Ser Phe His Tyr Thr Arg Val lie Leu Gly 
35 40 45 

Ser Thr Leu Ala Trp Ala Arg Ala Ser Ala Leu Cys Leu Asn Phe Asn 
50 55 €0 

Cys Met Leu lie Leu He Pro Val Ser Arg Asn Leu He Ser Phe He 
65 70 75 80 

Arg Gly Thr Ser He Cys Cys Arg Gly Pro Trp Arg Ar^ Gin Leu Asp 
85 90 95 

Lys Asn Leu Arg Phe His Lys Leu Val Ala Tyr Gly He Ala Val Asn 
100 105 110 

Ala Thr He His He Val Ala His Phe Phe Ash Leu Glu Arg Tyr His 
115 120 125 

Trp Ser Gin Ser Glu Glu Ala Gin Gly Leu Leu Ala Ala Leu Ser Lys 
130 135 140 

Leu Gly Asn Thr Pro Asn Glu Ser Tyr Leu Asn Pro Val Arg Thr Phe 
145 150 155 160 

Pro Thr Asn Thr Thr Thr Glu Leu Leu Arg Thr He Ala Gly Val Thr 
165 170 175 

Gly Leu Val He Ser Leu Ala Leu Val Leu He Met Thr Ser Ser Thr 
180 185 190 

Glu Phe He Arg Gin Ala Ser Tyr Glu Leu Phe Trp Tyr Thr His His 
. 195 200 205 
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Val Phe lie Val Phe Phe Leu Ser Leu Ala lie His Gly Thr Gly Arg 
210 215 220 

He Val Arg Gly Gin Thr Gin Asp Ser Leu Ser Leu His Asn He Thr 
225 230 235 240 

Phe Cys Arg Asp Arg Tyr Ala Glu Trp Gin Thr Val Ala Gin Cys Pro 
245 250 255 

Val Pro Gin Phe Ser Gly Lys Glu Pro Ser Ala Trp Lys Trp He Leu 
260 265 270 

Gly Pro Val Val Leu Tyr Ala Cys Glu Arg He He Arg Phe Trp Arg 
275 280 285 

Phe Gin Gin Glu Val Val He Thr Lys Val Val Ser His Pro ser Gly 
290 295 300 

Val Leu Glu Leu His Met Lys Lys Arg Gly Phe Lys Met Ala Pro Gly 
305 310 315 320 

Gin Tyr He Leu Val Gin Cys Pro Ala He Ser Ser Leu Glu Trp His 
325 330 335 

Pro Phe Thr Leu Thr Ser Ala Pro Gin Glu Asp Phe Phe Ser Val His 
340 345 350 

He Arg Ala Ala Gly Asp Trp Thr Ala Ala Leu Leu Glu Ala Phe Gly 
355 360 365 

Ala Glu Gly Gin Ala Leu Gin Glu Pro Trp Ser Leu Pro Arg Leu Ala 
370 375 380 

Val Asp Gly Pro Phe Gly Thr Ala Leu Thr Asp Val Phe His Tyr Pro 
385 390 , 395 400 

Val Cys Val Cys Val Ala Ala Gly He Gly Val Thr Pro Phe Ala Ala 
405 410 415 

Leu Leu Lys Ser He Trp Tyr Lys Cys Ser Glu Ala Gin Thr Pro Leu 
420 425 430 

Lys Leu Ser Lys Val Tyr Phe Tyr Trp He Cys Arg Asp Ala Arg Ala 
435 440 445 

Phe Glu Trp Phe Ala Asp Leu Leu Leu Ser Leu Glu Thr Arg Met Ser 
450 455 460 
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Glu Gin Gly Lys Thr His Phe Leu 
465 470 

Trp Asp Glu Asn Gin Ala Leu His 
485 

Thr Asp Val He Thr Gly Leu Lys 
500 



Ser Tyr His He Phe Leu Thr Gly 
475 480 

He Ala Leu His Trp Asp Glu Asn 
490 495 

Gin Lys Thr Phe Tyr Gly Arg Pro 
505 510 



Asn Trp Asn Asn Glu Phe Lys Gin lie Ala Tyr Asn His Pro Ser Ser 
515 520 525 

Ser He Gly Val Phe Phe Cys Gly Pro Lys Ala Leu Ser Arg Thr Leu 
530 535 540 

Gin Lys Met Cys His Leu Tyr Ser Ser Ala Asp Pro Arg Gly Val His 
545 550 555 560 

Phe Tyr Tyr Asn Lys Glu Ser Phe 
565 



<210> 5 
<211> 21 
<212> DKA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 5 

aacaagcgtg gcttcagcat g 21 

<210> 6 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 6 

agcaatattg ttggtcat 18 

<210> 7 
<211> 24 
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<212> DKA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 7 

gacttgacag aaaatctata aggg 24 

<210> 8 
<211> 20 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of Artificial Sequence: primer 



<210> 9 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 9 

caggtctgaa acagaaaacc t 21 



<210> 10 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 



<400> 8 

ttgtaccaga tggatttcaa. 



20 



<400> 10 

atgaattctc attaattatt caataaa 



27 



<210> 11 
<211> 20 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 11 

atctcaaaag actctgcaca 20 



<210> 12 
<211> 569 
<212> PRT 

<213> Homo sapiens 
<400> 12 

Gly Asn Trp Ala Val Asn Glu Gly Leu Ser He Phe Ala He Leu Val 
1 * 5 10 15 

Trp Leu Gly Leu Asn Val Phe Leu Phe Val Trp Tyr Tyr Arg Val Tyr 
20 25 30 

Asp He Pro Pro Lys Phe Phe Tyr Thr Arg Lys Leu Leu Gly Ser Ala 
35 40 45 

Leu Ala Leu Ala Arg Ala Pro Ala Ala Cys Leu Asn Phe Asn Cys Met 
50 55 60 

Leu He Leu Leu Pro Val Cys Arg Asn Leu Leu Ser Phe Leu Arg Gly 
65 70 75 80 

Ser ser Ala Cys Cys Ser Thr Arg Val Arg Arg Gin Leu Asp Arg Asn 
85 90 95 

Leu Thr Phe His Lys Met Val Ala Trp Met He Ala Leu His Ser Ala 
100 105 110 

He His Thr He Ala His Leu Phe Asn Val Glu Trp Cys Val Asn Ala 
115 120 125 

Arg Val Asn Asn Ser Asp Pro Tyr Ser Val Ala Leu Ser Glu Leu Gly 
130 135 140 

Asp Arg Gin Asn Glu Ser Tyr Leu Asn Phe Ala Arg Lys Arg He Lys 
145 150 155 160 

Asn Pro Glu Gly Gly Leu Tyr Leu Ala Val Thr Leu Leu JUa Gly He 
165 170 175 
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Thr Gly Val Val lie Thr Leu Cys Leu lie Leu He He Thr Ser Ser 
180 185 190 

Thr Lys Thr He Arg Arg Ser Tyr Phe Glu Val Phe Trp Tyr Thr His 
195 200 205 

His Leu Phe Val He Phe Phe He Gly Leu Ala He His Gly Ala Glu 
210 215 220 

Arg He Val Arg Gly Gin Thr Ala Glu Ser Leu Ala Val His Asn He 
225 230 235 240 

Thr Val Cys Glu Gin Lys He Ser Glu Trp Gly Lys He Lys Glu Cys 
245 250 255 

Pro He Pro Gin Phe Ala Gly Asn Pro Pro Met Thr Trp Lys Trp He 

260 • 265 270 

Val Gly Pro Met Phe Leu Tyr Leu Cys Glu Arg Leu Val Arg Phe Trp 
275 280 285 

Arg Ser Gin Gin Lys Val Val He Thr Lys Val Val Thr His Pro Phe 
290 295 300 

Lys Thr He Glu Leu Gin Met Lys Lys Lys Gly Phe Lys Met Glu Val 
305 310 315 320 



Gly Gin Tyr He Phe Val Lys Cys 
325 

His Pro Phe Thr Leu Thr Ser Ala 
340 

His He Arg He Val Gly Asp Trp 
355 360 

Gly Cys Asp Lys Gin Glu Phe Gin 
370 ■ 375 

Ala Val Asp Gly Pro Phe Gly Thr 
385 390 



Pro Lys Val Ser Lys Leu Glu Trp 
330 335 

Pro Glu Glu Asp Phe Phe Ser He 
345 350 

Thr Glu Gly Leu Phe Asn Ala Cys 
365 

Asp Ala Trp Lys Leu Pro Lys He 
380 

Ala Ser Glu Asp Val Phe Ser Tyr 
395 400 



Glu Val Val Met Leu Val Gly Ala Gly He Gly Val Thr Pro Phe Ala 
.405 410 415 

Ser He Leu Lys Ser Val Trp Tyr Lys Tyr Cys Asn Asn Ala Thr Asn 
420 425 430 



PCTAJS99/26592 
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Leu Lys Leu Lys Lys lie Tyr Phe Tyr Trp Leu cys Arg Asp Thr His 
435 440 445 

Ala Phe Glu Trp Phe Ala Asp Leu Leu Gin Leu Leu Glu Ser Gin Met 
450 455 460 

Gin Glu Arg Asn Asn Ala Gly Phe Leu Ser Tyr Asn lie Tyr Leu Thr 
465 470 475 480 

/ 

Gly Trp Asp Glu Ser Gin Ala Asn His' Phe Ala Val His His Asp Glu 
485 490 495 

Glu Lys Asp Val lie Thr Gly Leu Lys Gin Lys Thr Leu Tyr Gly Arg 
500 505 510 

Pro Asn Trp Asp Asn Glu Phe Lys Thr lie Ala Ser Gin His Pro Asn 
515 520 525 

Thr Arg He Gly Val Phe Leu Cys Gly Pro Glu Ala Leu Ala Glu Thr 
530 535 540 

Leu Ser Lys Gin Ser He Ser Asn Ser Glu Ser Gly Pro Arg Gly Val 
545 550 555 560 

His Phe He Phe Asn Lys Glu Asn Phe 
565 



<210> 13 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 13 

ttggctaaat cccatcca 18 



<210> 14 
<211> 21 
<212> DHA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 

17 
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<400> 14 

tgcatgacca acaatattgc t 21 



<210> 15 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 15 

caaggtacct cttgaccatg ggaaact 27 



<210> 16 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 16 

acgaattcaa gtaaattact gaagata 27 



<210> 17 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<221> modified_base 
<222> ()..) 

<223> n at position 3 = inosine 
<220> 

<221> modified_base 
<222> () 

<223> n at position 6 = inosine 
<220> 

<221> modified_base 
<222> () 

<223> n at position 12 = inosine 
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<220> 

<223> Description of Artificial Sequence: primer 
<400> 17 

ccngtntgtc gnaatctgct stcctt 



<210> 18 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<221> mpdified^base 
<222> (5) 

<223> n at position 5 » inosine 
<220> 

<221> modified_base 
<222> (9) 

<223> n at position 9 « inosine 
<220> 

<221> modified_ba5e 
<222> (11) 

<223> n at position 11 - inosine 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 18 

tcccngcana nccagtagaa rtagatctt 

<210> 19 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 19 

ttggcacagt cagtgaggat gtct^c 
<210> 20 
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<211> 30 
<212> DKA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 20 

ctgttggctt ctactgtagc gttcaaagtt 



<210> 21 
<211> 563 
<212> PRT 
<213> Rat 

<400> 21 

Met Gly Asn Trp Leu Val Asn His Trp Leu Ser Val Leu Phe Leu Val 
1 5 10 15 

Ser Trp Leu Gly Leu Asn lie Phe Leu Phe Val Tyr Val Phe Leu Asn 
20 25 30 

Tyr Glu Lys Ser Asp Lys Tyr Tyr Tyr Thr Arg Glu lie Leu Gly Thr 
35 40 45 

Ala Leu Ala Leu Ala Arg .Ala Ser Ala Leu Cys Leu Asn Phe Asn Ser 
50 55 60 

Met Val lie Leu He Pro Val Cys Arg Asn Leu Leu Ser Phe Leu Arg 
65 70 75 80 

Gly Thr cys Ser Phe Cys Asn His Thr Leu Arg Lys Pro Leu Asp His 
85 90 95 

Asn Leu Thr Phe His Lys Leu Val Ala Tyr Met He Cys He Phe Thr 
100 105 110 

Ala He His He He Ala His Leu Phe Asn Phe Glu Arg Tyr Ser Arg 
115 120 125 

Ser Gin Gin Ala Met Asp Gly Ser Leu Ala Ser Val Leu Ser Ser Leu 
130 135 140 

Phe His Pro Glu Lys Glu Asp Ser Trp Leu Asn Pro He Gin Ser Pro 
145 150 155 160 

Asn Val Thr Val Met Tyr Ala Ala Phe Thr Ser He Ala Gly Leu Thr 
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163 170 175 

Gly Val Val Ala Thr Val Ala Leu Val Leu Met Val Thr Ser Ala Met 
180 185 

Glu Phe He Arg Arg Asn Tyr Phe Glu Leu Phe Trp Tyr Thr His His 
195 200 205 

Leu Phe He He Tyr He He Cys Leu Gly He His Gly Leu Gly Gly 
210 215 220 

He Val Arg Gly Gin Thr Glu Glu Ser Met Ser Glu Ser His Pro Arg 
225 230 235 240 

Asn Cys Ser Tyr Ser Phe His Glu Trp Asp Lys Tyr Glu Arg Ser Cys 
245 250 255 

Arg Ser Pro His Phe Val Gly Gin Pro Pro Glu Ser Trp Lys Trp He 
260 265 270 

Leu Ala Pro He Ala Phe Tyr He Phe Glu Arg He Leu Arg Phe Tyr 
275 280 285 

Arg Ser Arg Gin Lys Val Val He Thr Lys Val Val Met His Pro Cys 
290 295 300 

Lys Val Leu Glu Leu Gin Met Arg Lys Arg Gly Phe Thr Met Gly He 
305 310 315 320 

Gly Gin Tyr He Phe Val Asn Cys Pro Ser He Ser Phe Leu Glu Trp 
325 330 335 

His Pro Phe Thr Leu Thr Ser Ala Pro Glu Glu Glu Phe Phe Ser He 
340 345 350 

His He Arg Ala Ala Gly Asp Trp Thr Glu Asn Leu He Arg Thr Phe 
355 360 365 

Glu Gin Gin His Ser Pro Met Pro Arg He Glu Val Asp Gly Pro Phe 
370 375 380 

Gly Thr Val Ser Glu Asp Val Phe Gin Tyr Glu Val Ala Val Leu Val 
385 390 395 4OO 

Gly Ala Gly He Gly Val Thr Pro Phe Ala Ser Phe Leu Lys Ser He 
405 410 415 

Trp Tyr Lys Phe Gin Arg Ala His Asn Lys Leu Lys Thr Gin Lys He 
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Tyr Phe Tyr Trp lie Cys Arg Glu Thr Gly Ala Phe Ala Trp Phe Asn 
435 440 

Asn Leu Leu Asn Ser Leu Glu Gin Glu Met Asp Glu Leu Gly Lys Pro 
450 455 460 

Asp Phe Leu Asn Tyr Arg Leu Phe Leu Thr Gly Trp Asp Ser Asn He 
465 470 475 430 

Ala Gly His Ala Ala Leu Asn Phe Asp Arg Ala Thr Asp Val Leu Thr 
485 490 495 

Gly Leu Lys Gin Lys Thr Ser Phe Gly Arg Pro Met Trp Asp Asn Glu 
500 505 510 

Phe Ser Arg He Ala Thr Ala His Pro Lys Ser Val Val Gly Val Phe 
515 520 525 

Leu Cys Gly Pro Pro Thr Leu Ala Lys Ser Leu Arg Lys Cys Cys Arg 
530 535 540 

Arg Tyr Ser Ser Leu Asp Pro Arg Lys Val Gin Phe Tyr Phe Asn Lys 
545 550 555 560 

Glu Thr Phe 



<210> 22 
<211> 2577 
<212> DKA 
<213> Rat 

<400> 22 

ttctgagtag gtgtgcattt gagtgtcata 
cctatcctga aggatcccat cagagaaacc 
tttgacaatg ggaaactggc tggttaacca 
gttggggctg aacatttttc tgtttgtgta 
gtactattac acgagagaaa ttctcggaac 
gtgcctgaat tttaacagca tggtgatcct 

22 



aagacatata tcttgagcta gacagaagtt 60 
agattgctcc taagaggctc cagacctcca 120 
ctggctctca gttttgtttc tggtttcttg 180 
cgtcttcctg aattatgaga agtctgacaa 240 
tgccttggcc ttggccagag catctgcttt 300 
gattcctgtg tgtcgaaatc tgctctcctt 360 



WP 00/28031 PCT/US99/26592 
cctgaggggc acctgctcat tttgcaacca cacgctgaga aagccattgg atcacaacct 420 

caccttccat aagctggtgg catatatgat ctgcatattc acagctattc atatcattgc 480 

acatctattt aactttgaac gctacagtag aagccaacag gccatggatg gatctcttgc 540 

ctctgttctc tccagcctat tccatcccga gaaagaagat tcttggctaa atcccatcca 600 

gtctccaaac gtgacagtga tgtatgcagc atttaccagt attgctggcc ttactggagt 660 

ggtcgccact gtggctttgg ttctcatggt aacttcagct atggagttta tccgcaggaa 720 

ttattttgag ctcttctggt atacacatca ccttttcatc atctatatca tctgcttagg 780 

gatccatggc ctggggggga ttgtccgggg tcaaacagaa gagagcatga gtgaaagtca 840 

tccccgcaac tgttcatact ctttccacga gtgggataag tatgaaagga gttgcaggag 900 

tcctcatttt gtggggcaac cccctgagtc ttggaagtgg atcctcgcgc cgattgcttt 960 

ttatatcttt gaaaggatcc ttcgctttta tcgctcccgg cagaaggtcg tgattaccaa 1020 

ggttgtcatg cacccatgta aagttttgga attgcagatg aggaagcggg gctttactat 1080 

gggaatagga cagtatatat tcgtaaattg cccctcgatt tccttcctgg aatggcatcc 1140 

ctttactctg acctctgctc cagaggaaga atttttctcc attcatattc gagcagcagg 1200 

ggactggaca gaaaatctca taaggacatt tgaacaacag cactcaccaa tgcccaggat .1260 

cgaggtggat ggtccctttg gcacagtcag tgaggatgtc ttccagtacg aagtggctgt 1320 

actggttggg gcagggattg gcgtcactcc ctttgcttcc ttcttgaaat ctatctggta 1380 

caaattccag cgtgcacaca acaagctgaa aacacaaaag atctatttct actggatttg 1440 

tagagagacg ggtgcctttg cctggttcaa caacttattg aattccctgg aacaagagat 1500 

ggacgaatta ggcaaaccgg atttcctaaa ctaccgactc ttcctcactg gctgggatag 1560 

caacattgct ggtcatgcag cattiaaactt tgacagagcc actgacgtcc tgacaggtct 1620 * 

gaaacagaaa acctcctttg ggagaccaat gtgggacaat gagttttcta gaatagctac 1680 

tgcccacccc aagtctgtgg tgggggtttt cttatgcggc cctccgactt tggcaaaaag 1740 

cctgcgcaaa tgctgtcggc ggtactcaag tctggatcct aggaaggttc aattctactt 1800 
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caacaaagaa acgttctgaa ttggaggaag ccgcacagta gtacttctcc atcttccttt 1860 

tcactaacgt gtgggtcagc tactagatag tccgttgtcg cacaaggact tcactcccat 1920 

cttaaagttg actcaactcc atcattcttg ggctttggca acatgagagc tgcataactc 1980 

acaattgcaa aacacatgaa ttattattgg ggggattgta aatccttctg ggaaacctgc 2040 

ctttagctga atcttgctgg ttgacacttg cacaatttaa cctcaggtgt cttggttgat 2100 

acctgataat cttccctccc acctgtccct cacagaagat ttctaagtag ggtgatttta 2160 

aaatatttat tgaatccacg acaaaacaat aatcataaat aataaacata aaattaccaa 2220 

gattcccact cccatatcat acccactaag aacatcgtta tacatgagct tatcatccag 2280 

tgtgaccaac aatttatact ttactgtgcc aaaataatct tcatctttgc ttattgaaca 2340 

attttgctga ctttccctag taatatctta agtatattaa ctggaatcaa atttgtatta 2400 

tagttagaag ccaactatat tgccagtttg tattgtttga aataactgga aaggcctgac 2460 

ctacatcgtg gggtaattta acagaagctc tttccatttt ttgttgttgt tgttaaagag 2520 

ttttgtttat gaatgtgtta taaaaagaaa ataaaaagtt ataattttga cggaaaa 2577 

<210> 23 
<211> 332 
<212> PRT 

<213> Homo sapiens 
<400> 23 

Glu Ser Met Asn Glu Ser His Pro Arg Lys Cys Ala Glu Ser Phe Glu 
1 5 10 15 

Met Trp Asp Asp Arg Asp Ser His Cys Arg Arg Pro Lys Phe Glu Gly 
20 25 30 

His Pro Pro Glu Ser Trp Lys Trp lie Leu Ala Pro Val lie Leu Tyr 
35 40 45 

He cys Glu Arg He Leu Arg Phe Tyr Arg Ser Gin Gin Lys Val Val 
50 55 60 

He Thr Lys Val Val Met His Pro Ser Lys Val Leu Glu Leu Gin Met 
65 70 75 80 
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Aan Lys Arg Gly Phe Ser Met Glu Val Gly Gin Tyr lie Phe Val Asn 
85 90 95 

Cys Pro Ser He Ser Leu Leu Glu Trp His Pro Phe Thr Leu Thr Ser 
100 105 110 

Ala Pro Glu Glu Asp Phe Phe Ser He His He Arg Ala Ala Gly Asp 
115 120 125 

Trp Thr Glu Asn Leu He Arg Ala Phe Glu Gin Gin Tyr Ser Pro He 
130 135 140 

Pro Arg He Glu Val Asp Gly Pro Phe Gly Thr Ala Ser Glu Asp Val 
145 150 155 160 

Phe Gin Tyr Glu Val Ala Val Leu Val Gly Ala Gly He Gly Val Thr 
165 170 175 

Pro Phe Ala ser He Leu Lys Ser He Trp Tyr Lys Phe Gin Cys Ala 
180 185 190 

Asp His Asn Leu Lys Thr Lys Lys He Tyr Phe Tyr Trp He Cys Arg 
195 200 205 

Glu Thr Gly Ala Phe Ser Trp Phe Asn Asn Leu Leu Thr Ser. Leu Glu 
210 215 220 

Gin Glu Met Glu Glu Leu Gly Lys Val Gly Phe Leu Asn Tyr Arg Leu 
225 230 235 240 

Phe Leu Thr Gly Trp Asp Ser Asn He Val Gly His Ala Ala Leu Asn 
245 250 255 

Phe Asp Lys Ala Thr Asp He Val Thr Gly Leu Lys Gin Lys Thr Ser 
260 265 270 

Phe Gly Arg Pro Met Trp Asp Asn Glu Phe Ser Thr He Ala Thr Ser 
275 280 285 

His Pro Lys Ser Val Val Gly Val Phe Leu Cys Gly Pro Arg Thr Leu 
290 295 300 

Ala Lys Ser Leu Arg Lys Cys Cys His Arg Tyr Ser Ser Leu Asp Pro 
305 310 315 320 

Arg Lys Val Gin Phe Tyr Phe Asn Lys Glu Asn Phe 
325 330 
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<210> 24 

<211> 14 

<212> PRT 

<213> Homo sapiens 

<400> 24 

cys' Ala Glu Ser Phe Glu Met Trp Asp Asp Arg Asp Ser His 
15 10 



<210> 25 
<211> 14 
<212> PRT 

<213> Homo sapiens 
<400> 25 

Lys Ser Leu Arg Lys Cys Cys His T^g Tyr Ser Ser Leu Asp 
15 10 



<210> 26 
<211> 24 
<212> DKA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 26 

gaagtggtgg gaggcgaaga cat a 24 



<210> 27 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 

<400> 27 

cctgtcatac ctgggacggt ctgg 24 



<210> 28 
<211> 24 
<212> DNA 
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<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: primer 
<400> 28 

gagcacagtg agatgcctgt tcag 

<210> 29 
<211> 24 
<212> DMA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 29 

ggaaggcagc agagagcaat gatg 

<210> 30 
<211> 24 
<212> DMA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 30 

acatctgcga gcggcacttc caga 

<210> 31 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 31 

agctcgtcaa caggcaggac cgagc 

<210> 32 
<211> 24 
<212> DMA 



27 



wo 00/28031 
<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: primer 
<400> 32 

gcagtgcatc cacatcttca gcac 

<210> 33 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 33 

gagagctctg gagacacttg agttc 

<210> 34 
<211> 22' 
<212> DNA 

<2X3> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 34 

catgttctct ctggctgaca ag 

<210> 35 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 35 

cacaatagcg agctccgctt cacgc 

<210> 36 
<211> 24 
<212> DNA 
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<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: primer 
<400> 36 

gcaggacatc aaccctgcac tctc 



<210> 37 
<211> 570 
<212> PRT 
<213> Bovine 

<400> 37 

Mtet Gly Asn Trp Val Val Asn Glu Gly He Ser He Phe Val He Leu 
15 10 15 

Val Trp Leu Gly Met Asn Val Phe Leu Phe Val Trp Tyr Tyr Arg Val 
20 25 30 

Tyr Asp He Pro Asp Lys Phe Phe Tyr Thr Arg Lys Leu Leu Gly Ser 
35 40 45 

Ala Leu Ala Leu Ala Arg Ala Pro Ala Ala Cys Leu Asn Phe.Asn Cys 
50 55 60 

Met Leu He Leu Leu Pro Val Cys Arg Asn Leu Leu Ser Phe Leu Arg 
65 70 75 80 

Gly Ser Ser Ala Cys Cys Ser Thr Arg He Arg Arg Gin Leu Asp Arg 
85 90 95 

Asn Leu Thr Phe His Lys Met Val Ala Trp Met He Ala Leu His Thr 
100 105 110 

Ala He His Thr He Ala His Leu Phe Asn Val Glu Trp Cys Val Asn 
115 120 125 

Ala Arg Val Asn Asn Ser Asp Pro Tyr Ser He Ala Leu Ser Asp He 
130 135 140 

Gly Asp Lys Pro Asn Glu Thr Tyr Leu Asn Phe Val Arg Gin Arg He 
145 150 155 160 

Lys Asn Pro Glii Gly Gly Leu Tyr Val Ala Val Thr Arg Leu Ala Gly 
165 170 175 
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He Thr .GXy Val Val He Thr Leu cys Leu He Leu lie He Thr Ser 
180 185 190 

Ser Thr Lys Thr He Arg Arg Ser Tyr Phe Glu Val Phe Trp Tyr Thr 
195 200 205 

His His Leu Phe Val He Phe Phe He Gly Leu Ala He His Gly Ala 
210 215 220 

Gin Arg He Val Arg Gly Gin Thr Ala Glu Ser Leu Leu Lys His Gin 
225 230 235 240 

Pro Arg Asn Cys Tyr Gin Asn He Ser Gin Trp Gly Lys He Glu Asn 
245 250 255 

Cys Pro He Pro Glu Phe Ser Gly Asn Pro Pro Met Thr Trp Lys Trp 
260 265 270 

He Val Gly Pro Met Phe Leu Tyr Leu Cys Glu Arg Leu Val Arg Phe 
275 280 285 

Trp Arg Ser Gin Gin Lys Val Val He Thr Lys Val Val Thr His Pro 
290 295 300 

Phe Lys Thr He Glu Leu Gin Met Lys Lys Lys Gly Phe Lys .Met Glu 
305 310 315 320 

Val Gly Gin Tyr He Phe Val Lys Cys Pro Val Val Ser Lys Leu Glu 
325 330 335 

Trp His Pro Phe Thr Leu Thr Ser Ala Pro Glu Glu Asp Phe Phe Ser 
340 345 350 

He His He Arg He Val Gly Asp Trp Thr Glu Gly Leu Phe Lys Ala 
355 360 365 

Cys Gly Cys Asp Lys Gin Glu Phe Gin Asp Ala Trp Lys Leu Pro Lys 
370 375 380 

He Ala Val Asp Gly Pro Phe Gly Thr Ala Ser Glu Asp Val Phe Ser 
385 390 395 400 

Tyr Glu Val Val Met Leu Val Gly Ala Gly He Gly Val Thr Pro Phe 
405 410 415 

Ala Ser He Leu Lys Ser Val Trp Tyr Lys Tyr Cys Asn Lys Ala Pro 
. 420 425 430 
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Asn Leu Arg Leu Lys Lys lie Tyr 
435 440 

His Ala Phe Glu Trp Phe Ala Asp 
450 455 

Met Gin Glu Lys Asn Asn Thr Asp 
465 470 

Thr Gly Trp Asp Glu Ser Gin Ala 
485 

Glu Glu Lys Asp Val lie Thr Gly 
500 

Arg Pro Asn Trp Asp Asn Glu Phe 
515 520 

Asn Thr Arg lie Gly Val Phe Leu 
530 535 

Thr Leu Asn Lys Gin Cys lie Ser 
545 550 

Val His Phe He Phe Asn Lys Glu 
565 



Phe Tyr Trp Leu Cys Arg Asp Thr 
445 

Leu Leu Gin Leu Leu Glu Thr Gin 
460 

Phe Leu Ser Tyr Asn He Cys Leu 
475 480 

Ser His Phe Ala Met His His Asp 
490 495 

Leu Lys Gin Lys Thr Leu Tyr Gly 
505 510 

Lys Thr lie Gly Ser Gin His Pro 
525 

Cys Gly Pro Glu Ala Leu Ala Asp 
540 

Asn Ser Asp Ser Gly Pro Arg Gly 
555 560 

Asn Phe 
570 



<210> 38 
<211> 570 
<212> PRT 
<213> murine 

<400> 38 

Met Gly Asn Trp Ala Val Asn Glu Gly Leu Ser He Phe Val He Leu 
1 ' 5 10 15 

Val Trp Leu Gly Leu Asn Val Phe Leu Phe He Asn Tyr Tyr Lys Val 
20 25 30 



Gly Pro Lys Tyr Asn Tyr Thr Arg Lys Leu Leu Gly Ser 
40 45 ^ 

Leu Ala Arg ALa Pro Ala Ala Cys Leu Asn Phe Asn Cys 
55 60 

Met Leu He Leu Leu Pro Val Cys Arg Asn Leu Leu Ser Phe Leu Arg 
65 70 75 80 
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Gly Ser Ser Ala Cys Cys Ser Thr Arg He Arg Arg Gin Leu Asp Arg 
85 90 95 

Asn Leu Thr Phe His Lys Met Val Ala Trp Met He Ala Leu His Thr 
100 105 110 

Ala He His Thr He Ala His Leu Phe Asn Val Glu Trp Cys Val Asn 
115 120 125 

Ala Arg Val Gly He Ser Asp Arg Tyr Ser He Ala Leu Ser Asp He 
130 135 140 

Gly Asp Asn Glu Asn Glu Glu Tyr Leu Asn Phe Ala Arg Glu Lys He 
145 150 155 160 

Lys Asn Pro Glu Gly Gly Leu Tyr Val Ala Val Thr Arg Leu Ala Gly 
i65 170 175 

He Thr Gly He Val He Thr Leu Cys Leu He Leu He He Thr Ser 
180 185 190 

Ser Thr Lys Thr He Arg Arg Ser Tyr Phe Glu Val Phe Trp Tyr Thr 
195 200 205 

His His Leu Phe Val He Phe Phe He Gly Leu Ala He His Gly JUa 
210 215 220 

Glu Arg He Val Arg Gly Gin Thr Ala Glu Ser Leu Glu Glu His Asn 
22'5 230 235 240 

Leu Asp He Cys Ala Asp Lys He Glu Glu Trp Gly Lys He Lys Glu 
245 250 255 

Cys Pro Val Pro Lys Phe Ala Gly Asn Pro Pro Met Thr Trp Lys Trp 
260 265 270 

He Val Gly Pro Met Phe Leu Tyr Leu Cys Glu Arg Leu Val Arg Phe 
275 280 285 

Trp Arg Ser Gin Gin Lys Val Val He Thr Lys Val Val Thr His Pro 
290 295 300 

Phe Lys Thr He Glu Leu Gin Met Lys Lys Lys Gly Phe Lys Met Glu 
305 310 315 320 

Val Gly Gin Tyr He Phe Val Lys Cys Pro Lys Val Ser Lys Leu Glu 
325 330 335 
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Trp His Pro Phe Thr Leu Thr Ser Ala Pro Glu Glu Asp Phe Phe Ser 
340 345 350 

lie His lie Arg lie Val Gly Asp Trp Thr Glu Gly Leu Phe Asn Ala 
355 360 365 

Cys Gly Cys Asp Lys Gin Glu Phe Gin Asp Ala Trp Lys Leu Pro Lys 
370 375 380 

He Ala Val Asp Gly Pro Phe Gly Thr Ala Ser Glu Asp Val Phe Ser 
385 390 395 400 

Tyr Glu Val Val Met Leu Val Gly Ala Gly He Gly Val Thr Pro Phe 
405 410 415 

Ala Ser He Leu Lys Ser Val Trp Tyr Lys Tyr Cys Asp Asn Ala Thr 
420 425 430 

Ser Leu Lys Leu Lys Lys He Tyr Phe Tyr Trp Leu Cys Arg Asp Thr 
435 440 445 

His Ala Phe Glu Trp Phe Ala Asp Leu Leu Gin Leu Leu Glu Thr Gin 
450 455 460 

Met Gin Glu Arg Asn Asn Ala J\sn Phe Leu Ser Tyr Asn He Tyr Leu 
465 470 475 480 

Thr Gly Trp Asp Glu Ser Gin Ala Asn His Phe Ala Val His His Asp 
485 490 495 

Glu Glu Lys Asp Val He Thr Gly Leu Lys Gin Lys Thr Leu Tyr Gly 
500 505 310 

Arg Pro Asn Trp Asp Asn Glu Phe Lys Thr He Ala iser Glu His Pro 
515 520 525 

Asn Thr Thr He Gly Val Phe Leu Cys Gly Pro Glu Ala Leu Ala Glu 
530 535 540 

Thr Leu Ser Lys Gin Ser He Ser Asn Ser Glu Ser Gly Pro Arg Gly 
545 550 555 560 

Val His Phe He Phe Asn Lys Glu Asn Phe 
565 570 



<210> 39 
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<211> 944 
<212> PRT 

<213> Acabldopsis sp. 



<400> 39 

Met Lys Pro Phe Ser Lys Asn Asp Arg Arg Arg Trp Ser Phe Asp Ser 
1.5 10 15 

val ser Ala Gly Lys Thr Ala Val Gly ser Ala ser Thr Ser Pro Gly 
2° 25 30 

Thr Glu Tyr Ser lie Asn Gly Asp Gin Glu Phe Val Glu Val Thr lie 
35 40 45 

Asp Leu Gin Asp Asp Asp Thr lie Val Leu Arg Ser Val Glu Pro Ala 
50 35 60 

Thr Ala lie Asn Val He Gly Asp He Ser Asp Asp Asn Thr Gly He 



65 70 75 



80 



Met Thr Pro Val Ser He Ser Arg Ser Pro Thr Met Lys Arg Thr Ser 
85 90 „ 

ser Asn Arg Phe Arg Gin Phe Ser Gin Glu Leu Lys Ala Glu Ala Val 
100 105 110. 

Ala Lys Ala Lys Gin Leu Ser Gin Glu Leu Lys Arg Phe Ser Trp Ser 
115 120 125 

Arg ser Phe Ser Gly Asn Leu Thr Thr Thr Ser Thr Ala Ala Asn Gin 
130 135 140 

Ser Gly Gly Ala Gly Gly Gly Leu Val Asn Ser Ala Leu Glu Ala Arg 
150 155 160 

Ala Leu Arg Lys Gin Arg Ala Gin Leu Asp Arg Thr Arg Ser Ser Ala 
165 170 175 

Gin Arg Ala Leu Arg Gly Leu Arg Phe He Ser Asn Lys Gin Lys Asn 
180 185 190 

Val Asp Gly Trp Asn Asp Val Gin Ser Asn Phe Glu Lys Phe Glu Lys 
195 200 205 

Asn Gly Tyr He Tyr Arg Ser Asp Phe Ala Gin Cys He Gly Met Lys 
210 215 220 

Asp Ser Lys Glu Phe Ala Leu Glu Leu Phe Asp Ala Leu Ser Arg Arg 
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240 



Arg Arg Leu Lys Val Glu Lys lie Asn His Asp Glu Leu Tyr Giu Tyr 
245 250 255 

Trp Ser Gin He Asn Asp Glu Ser Phe Asp Ser Arg Leu Gin He Phe 
260 265 270 

Phe Asp He Val Asp Lys Asn Glu Asp Gly Arg He Thr Glu Glu Glu 
275 280 285 

Val Lys Glu He He Met Leu Ser Ala Ser Ala Asn Lys Leu Ser Arg 
290 295 300 

Leu Lys Glu Gin Ala Glu Glu Tyr Ala Ala Leu He Met Glu Glu Leu 
305 310 315 320 

Asp Pro Glu Arg Leu Gly Tyr He Glu Leu Trp Gin Leu Glu Thr Leu 
325 330 335 

Leu Leu Gin Lys Asp Thr Tyr Leu Asn Tyr Ser Gin Ala Leu Ser Tyr 
340 345 350 

Thr Ser Gin Ala Leu Ser Gin Asn Leu Gin Gly Leu Arg Gly Lys Ser ' 
355 360 365 

Arg He His Arg Met Ser Ser Asp Phe Val Tyr He Met Gin Glu Asn 
370 375 380 

Trp Lys Arg He Trp Val Leu Ser Leu Trp He Met He Met He Gly 
385 390 395 400 

Leu Phe Leu Trp Lys Phe Phe Gin Tyr Lys Gin Lys Asp 7U.a Phe His 
405 410 415 

Val Met Gly Tyr Cys Leu Leu Thr Ala Lys Gly Ala Ala Glu Thr Leu 
420 425 430 

Lys Phe Asn Met Ala Leu He Leu Phe Pro Val Cys Arg Asn Thr He 
435 440 445 

Thr Trp Leu Arg Ser Thr Arg Leu Ser Tyr Phe Val Pro Phe Asp Asp 
450 455 460 

Asn He Asn Phe His Lys Thr He Ala Gly Ala He Val Val Ala Val 
465 470 475 480 

He Leu His He Gly Asp His Leu J\la, Cys Asp Phe Pro Arg He Val 
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495 



Arg Ala Thr Glu Tyr Asp Tyr Asn Arg Tyr Leu Phe His Tyr Phe Gin 
500 505 510 

Thr Lys Gin Pro Thr Tyr Phe Asp Leu Val Lys Gly Pro Glu Gly lie 
515 520 525 

Thr Gly lie Leu Met Val lie Leu Met lie lie Ser Phe Thr Leu Ala 
530 535 540 

Thr Arg Trp Phe Arg Arg Asn Leu Val Lys* Leu Pro Lys Pro Phe Asp 
545 550 555 560 

Arg Leu Thr Gly Phe Asn Ala Phe Trp Tyr Ser His His Leu Phe Val 
565 570 375 

lie Val Tyr lie Leu Leu lie Leu His Gly lie Phe Leu Tyr Phe Ala 
580 585 590 

Lys Pro Trp Tyr Val Arg Thr Thr Trp Met Tyr Leu Ala Val Pro Val 
595 ' 600 605 

Leu Leu Tyr Gly Gly Glu Arg Thr Leu Arg Tyr Phe Arg Ser Gly Ser 
610 615 620 

Tyr Ser Val Arg Leu Leu Lys Val Ala lie Tyr Pro Gly Asn Val Leu 
625 630 635 640 

Thr Leu Gin Met Ser Lys Pro Thr Gin Phe Arg Tyr Lys Ser Gly Gin 
645 650 655 

Tyr Met Phe Val Gin Cys Pro Ala Val Ser Pro Phe Glu Trp His Pro 
660 665 670 

Phe Ser lie Thr Ser Ala Pro Glu Asp Asp Tyr lie Ser lie His lie 
675 680 685 

Arg Gin Leu Gly Asp Trp Thr Gin Glu Leu Lys Arg Val Phe Ser Glu 
690 695 700 

Val Cys Glu Pro Pro Val Gly Gly Lys Ser Gly Leu Leu Arg Ala Asp 
705 710 715 720 

Glu Thr Thr Lys Lys Ser Leu Pro Lys Leu Leu lie Asp Gly Pro Tyr 
725 730 735 

Gly Ala Pro Ala Gin Asp Tyr Arg Lys Tyr Asp Val Leu Leu Leu Val 
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740 745 750 

Gly Leu Gly He Gly Ala Thr Pro Phe He Ser He Leu Lys Asp Leu 
755 760 765 

Leu Asn Asn He Val Lys Met Glu Glu His Ala Asp Ser He Ser Asp 
770 775 780 

Phe Ser Arg Ser Ser Glu Tyr Ser Thr Gly ser Asn Gly Asp Thr Pro 
785 790 795 800 

Arg Arg Lys Arg He Leu Lys Thr Thr Asn Ala Tyr Phe Tyr Trp Val 
805 ' 810 815 

Thr Arg Glu Gin Gly Ser Phe Asp Trp Phe Lys Gly Val Met Asn Glu 
820 825 830 

Val Ala Glu Leu Asp Gin Arg Gly Val He Glu Met His Asn Tyr Leu 
835 840 845 

Thr Ser Val Tyr Glu Glu Gly Asp Ala Arg Ser Ala Leu He Thr Met 
850 855 860 

Val Gin Ala Leu Asn His Ala Lys Asn Gly Val Asp He Val Ser Gly 
865 870 875 880 

Thr Arg Val Arg Thr His Phe Ala Arg Pro Asn Trp Lys Lys Val Leu 
885 890 895 

Thr Lys Leu Ser Ser Lys His Cys Asn Ala Arg He Gly Val Phe Tyr 
900 905 910 

cys Gly Val Pro Val Leu Gly Lys Glu Leu Ser Lys Leu Cys Asn Thr 
915 920 925 

Phe Asn Gin Lys Gly Ser Thr Lys Phe Glu Phe His Lys Glu His Phe 
930 935 940 



<210> 40 
<211> 590 
<212> PRT 
<213> Rice 

<400> 40 
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Asn Leu Ala Gly Leu Arg Lys Lys Ser Ser He Arg Lys He Ser Thr 
15 10 15 

ser Leu Ser Tyr Tyr Phe Glu Asp Asn Trp Lys Arg Leu Trp Val Leu 
20 25 30 

Ala Leu Trp He Gly He Met Ala Gly Leu Phe Thr Trp Lys Phe Met 
35 40 45 

Gin Tyr Arg Asn Arg Tyr Val Phe Asp Val Met. Gly Tyr Cys Val Thr 
50 55 60 

Thr Ala Lys Gly Ala Ala Glu Thr Leu Lys Leu Asn Met Ala He He 
65 70 75 30 

Leu Leu Pro Val Cys Arg Asn Thr He Thr Trp Leu Arg Ser Thr Arg 
85 90 95 

Ala Ala Arg Ala Leu Pro Phe Asp Asp Asn He Asn Phe His Lys Thr 
100 105 110 

He Ala Ala Ala He Val Val Gly He He Leu His Ala Gly Asn His 
115 120 125 

Leu Val Cys Asp Phe Pro Arg Leu He Lys Ser Ser Asp Glu Lys Tyr 
130 135 140 

Ala Pro Leu Gly Gin Tyr Phe Gly Glu He Lys Pro Thr Tyr Phe Thr 
145 150 155 160 

Leu Val Lys Gly Val Glu Gly He Thr Gly Val He Met Val Val Cys 
165 170 175 

Met He He Ala Phe Thr Leu Ala Thr Arg Trp Phe Arg Arg. Ser Leu 
180 185 190 

Val Lys Leu Pro Arg Pro Phe Asp Lys Leu Thr Gly Phe Asn Ala Phe 
195 200 205 

Trp Tyr Ser His His Leu Phe He He Val Tyr He Ala Leu He Val 
210 215 220 

His Gly Glu Cys Leu Tyr Leu He His Val Trp Tyr Arg Arg Thr Thr 
225 230 235 240 

Trp Met Tyr Leu Ser Val Pro Val Cys Leu Tyr Val Gly Glu Arg He 
245 250 255 
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Leu Arg Phe Phe Arg Ser Gly Ser Tyr Ser Val Arg Leu Leu Lys Val 
260 265 270 



Ala lie Tyr Pro Gly Asn Val Leu Thr Leu Gin Met Ser Lys Pro 
275 280 285 



Pro 



Thr Phe Arg Tyr Lys Ser Gly Gin Tyr Met Phe Val Gin Cys Pro Ala 
290 295 300 

Val Ser Pro Phe Glu Trp His Pro Phe Ser He Thr Ser Ala Pro Gly 
305 310 315 320 

Asp Asp Tyr Leu Ser He His Val Arg Gin Leu Gly Asp Trp Thr Arg 
325 330 335 

Glu Leu Lys Arg Val Phe Ala Ala Ala Cys Glu Pro Pro Ala Gly Gly 
340 345 350 

Lys Ser Gly Leu Leu Arg Ala Asp Glu Thr Thr Lys Lys He Leu Pro 
355 360 365 

Lys Leu Leu He Asp Gly Pro Tyr Gly Ser Pro Ala Gin Asp Tyr Ser 
370 375 380 

Lys Tyr Asp Val Leu Leu Leu Val Gly Leu Gly He Gly Ala Thr Pro 
395 390 395 400 

Phe He Ser He Leu Lys Asp Leu Leu Asn Asn He He Lys Met Glu 
405 410 415 

Glu Glu Glu Asp Ala Ser Thr Asp Leu Tyr Pro Pro Met Gly Arg Asn 
420 425 430 

Asn Pro His Val Asp Leu Gly Thr Leu Met Thr He Thr Ser Arg Pro 
435 440 445 

Lys Lys He Leu Lys Thr Thr Asn Ala Tyr Phe Tyr Trp Val Thr Arg 
450 455 460 

Glu Gin Gly Ser Phe Asp Trp Phe Lys Gly Val Met Asn Glu He Ala 
465 470 475 480 

Asp Leu Asp Gin Arg Asn He He Glu Met His Asn Tyr Leu Thr Ser 
485 490 495 

Val Tyr Glu Glu Gly Asp Ala Arg Ser Ala Leu He Thr Met Leu Gin 
500 505 510 
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Ala Leu Asn His Ala Lys Asn Gly Val Asp lie Val Ser Gly Thr Lys 
515 520 525 

Val Arg Thr His Phe Ala Arg Pro Asn Trp Arg Lys Val Leu Ser Lys 
530 535 540 

lie Ser Ser Lys His Pro Tyr Ala Lys lie Gly Val Phe Tyr Cys Gly 
545 550 555 jgQ 

Ala Pro Val Leu Ala Gin Glu Leu Ser Lys Leu Cys His Glu Phe Asn 
565 570 575 

Gly Lys cys Thr Thr Lys Phe Asp Phe His Lys Glu His Phe 
580 585 590 

<210> 41 
<211> 2619 
<212> DMA 
<213> Rat 

<400> 41 

gtgctgtcag agctttacag agcctctggg catgcgcatg gctacccatt tcattgattt 60 
acagaagtca tgctaaaatc tctttcatgc atgtcttcct ttttcagtct ctcctttccc 120 
aaagcttttc agtttgccct ttgcttgtac caactgctat ccctcctcaa aggctgctgc 180 
aaaaggtatg cctttttctt ggaggctttc agcaaatact acctgggaac ctgcttcagc 240 
tcttggaata tttaagtgaa gagaacattt catagcattt gtatctttct ttgaaggagc 300 
caccagacag actgccttgg ccttggccag agcatctgct ttgtgcctga attttaacag 360 
catggtgatc ctgattcctg tgtgtcgaaa tctgctctcc ttcctgaggg gcacctgctc 420 
attttgcaac cacacgctga gaaagccatt ggatcacaac ctcaccttcc ataagctggt 480 
ggcatatatg atctgcatat tcacagctat tcatatcatt gcacatctat ttaactttga 540 ■ 
acgctacagt agaagccaac aggccatgga tggatctctt gcctctgttc tctccagcct 600 
attccatccc gagaaagaag attcttggct aaatcccatc cagtctccaa acgtgacagt 660 
gatgtatgca gcatttacca gtattgctgg ccttactgga gtggtcgcca ctgtggcttt 720 
ggttctcatg gtaacttcag ctatggagtt tatccgcagg aattattttg agctcttctg 780 
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gtatacacat caccttttca tcatctatat catctgctta gggatccatg gcctgggggg 840 

gattgtccgg ggtcaaacag aagagagcat gagtgaaagt catccccgca actgttcata 900 

ctctttccac gagtgggata agtatgaaag gagttgcagg agtcctcatt ttgtggggca .960 

accccctgag tcttggaagt ggatcctcgc gccgattgct ttttatatct ttgaaaggat 1020 

ccttcgcttt tatcgctccc ggcagaaggt cgtgattacc aaggttgtca tgcacccatg 1080 

taaagttttg gaattgcaga tgaggaagcg gggctttact atgggaatag gacagtatat 1140 

attcgtaaat tgcccctcga tttccttcct ggaatggcat ccctttactc tgacctctgc 1200 

tccagaggaa gaatttttct ccattcatat tcgagcagca ggggactgga cagaaaatct 1260 

cataaggaca tttgaacaac agcac^cacc aatgcccagg atcgaggtgg atggtccctt 1320 

tggcacagtc agtgaggatg tcttccagta cgaagtggct gtactggttg gggcagggat 1380 

tggcgtcact ccctttgctt ccttcttgaa atctatctgg tacaaattcc agcgtgcaca 1440 

caacaagctg aaaacacaaa agatctattt ctactggatt tgtagagaga cgggtgcctt 1500 

tgcctggttc aacaacttat tgaattccct ggaacaagag atggacgaat taggcaaacc 1560 

ggatttccta aactaccgac tcttcctcac tggctgggat agcaacattg ctggtcatgc 1620 

agcattaaac tttgacagag ccactgacgt cctgacaggt ctgaaacaga aaacctcctt 1680 

tgggagacca atgtgggaca atgagttttc tagaatagct actgcccacc ccaagtctgt 1740 

ggtgggggtt ttcttatgcg gccctccgac tttggcaaaa agcctgcgca aatgctgtcg 1800 

gcggtactca agtctggatc ctaggaaggt tcaattctac ttcaacaaag aaacgttctg 1860 

aattggagga agccgcacag tagtacttct ccatcttcct tttcactaac gtgtgggtca 1920 

gctactagat agtccgttgt cgcacaagga cttcactccc atcttaaagt tgactcaact 1980 

ccatcattct tgggctttgg caacatgaga gctgcataac tcacaattgc aaaacacatg 2040 

aattattatt ggggggattg taaatccttc tgggaaacct gcctttagct gaatcttgct 2100 

ggttgacact tgcacaattt aacctcaggt gtcttggttg atacctgata atcttccctc 2160 

ccacctgtcc ctcacagaag atttctaagt agggtgattt taaaatattt attgaatcca 2220 
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cgacaaaaca ataatcataa ataataaaca taaaattacc aagattccca ctcccatatc 2280 

atacccacta agaacatcgt tatacatgag cttatcatcc agtgtgacca acaatttata 23 40 

ctttactgtg ccaaaataat cttcatcttt gcttattgaa caattttgct gactttccct 2400 

agtaatatct taagtatatt aactggaatc aaatttgtat tatagttaga agccaactat 2460 

attgccagtt tgtattgttt gaaataactg gaaaggcctg acctacatcg tggggtaatt 2520 

taacagaagc tctttccatt ttttgttgtt gttgttaaag agttttgttt atgaatgtgt 2580 

tataaaaaga aaataaaaag ttataat^tt gacggaaaa 2619 

<210> 42 
<211> 499 
<212> PRT 
<213> Rat 

<400> 42 

Met Val lie Leu lie Pro Val Cys Arg Asn Leu Leu Ser Phe Leu Arg 
15 10 15 

Gly Thr cys Ser Phe Cys Asn His Thr Leu Arg Lys Pro Leu Asp His 
20 25 30 - 

Asn Leu Thr Phe His Lys Leu Val Ala Tyr Met lie Cys lie Phe Thr 
35 40 45 

Ala lie His lie lie Ala His Leu Phe Asn Phe Glu Arg Tyr Ser Arg 
50 55 60 

Ser Gin Gin Ala Met Asp Gly Ser Leu Ala Ser Val Leu Ser Ser Leu 
65 70 75 80 

Phe His Pro Glu Lys Glu Asp Ser Trp Leu Asn Pro lie Gin Ser Pro 
85 90 95 

Asn Val Thr Val Met Tyr Ala Ala Phe Thr Ser lie Ala Gly Leu Thr 
100 105 110 

Gly Val Val Ala Thr Val Ala Leu Val Leu Met Val Thr Ser Ala Met 
115 120 125 

Glu Phe He Arg TVrg Asn Tyr Phe Glu Leu Phe Trp Tyr Thr His His 
130 135 140 * 
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Leu Phe He He Tyr He He Cys Leu Gly He His Gly Leu Gly Gly 

145 150 155 

He Val Arg Gly Gin Thr Glu Glu Ser Met Ser Glu Ser His Pro Arg 

165 170 175 

Asn Cys Ser Tyr Ser Phe His Glu Trp Asp Lys Tyr Glu Arg Ser Cys 
180 185 190 

Arg Ser Pro His Phe Val Gly Gin Pro Pro Glu Ser Trp Lys Trp He 
195 200 '205 

Leu Ala Pro He Ala Phe Tyr He Phe Glu Arg He Leu Arg Phe Tyr 
210 215 . 220 

Arg Ser Arg Gin Lys Val Val He Thr Lys Val Val Met His Pro Cys 
225 230 235 240 

Lys Val Leu Glu Leu Gin Met Arg Lys Arg Gly Phe Thr Met Gly He 
245 250 ' 255 

Gly Gin Tyr He Phe Val Asn Cys Pro Ser He Ser Phe Leu Glu Trp 
260 265 270 

His Pro Phe Thr Leu Thr Ser Ala Pro Glu Glu Glu' Phe Phe Ser He 
275 280 285 

His He Arg Ala Ala Gly Asp Trp Thr Glu Asn Leu He Arg Thr Phe 
290 295 300 

Glu Gin Gin His Ser Pro Met Pro Arg He Glu Val Asp Gly Pro Phe 
305 310 315 320 

Gly Thr Val Ser Glu Asp Val Phe Gin Tyr Glu Val Ala Val Leu Val 
325 330 335 

Gly Ala Gly He Gly Val Thr Pro Phe Ala Ser Phe Leu Lys Ser He 
340 345 350 

Trp Tyr Lys Phe Gin Arg Ala His Asn Lys Leu Lys Thr Gin Lys He 
355 360 365 

Tyr Phe Tyr Trp He Cys Arg Glu Thr Gly Ala Phe Ala Trp Phe Asn 
370 375 380 

Asn Leu Leu Asn Ser Leu Glu Gin Glu Met Asp Glu Leu Gly Lys Pro 
3B5 390 395 400 
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Asp Phe Leu Asn Tyr Arg Leu Phe 
405 

Ala Gly His Ala Ala Leu Asn Phe 
420 

Gly Leu Lys Gin Lys Thr Ser Phe 
435 440 

Phe Ser Arg lie Ala Thr Ala His 
450 455 

' Leu Cys Gly Pro Pro Thr Leu Ala 
465 470 

Arg Tyr Ser Ser Leu Asp Pro Arg 
485 

Glu Thr Phe 



Leu Thr Gly Trp Asp Ser Asn lie 
410 415 

Asp Arg Ala Thr Asp Val Leu Thr 
425 430 

Gly Arg Pro Met Trp Asp Asn Glu 
445 

Pro Lys Ser Val Val Gly Val Phe 
460 

Lys Ser Leu Arg Lys Cys Cys Arg 
475 480 

Lys Val Gin Phe Tyr Phe Asn Lys 
490 495 



<210> 43 
<211> 35 
<212> DMA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic 
primer 

<400> 43 

ttctgagtag gtgtgcattt gagtgtcata aagac 

<210> 44 
<211> 45 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic 
primer 

<400> 44 

ttttccgtca aaattataac tttttatttt ctttttataa cacat 
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<210> 45 

<211> 5508 

<212> DNA 

<213> Homo sapiens 

<220> 
<221> CDS 

<222> (155).. (4810) 
<400> 45 

1 GCAGAGCTGC AGAGGCACCG GACGAGAGAG GGCTCCGCGG GCCCAGCTGG CAGCCAGGCC 
61 GGAGACAAGT TGCAGTCCCG GGCTCTGGTG ACGCCGTGGC CGCAGGGTCT CCATTTTGGG 
121 ACATTCTAAT CCCTGAGCCC CTATTATTTT CATCATGGGC TTCTGCCTGG CTCTAGCATG 
181 GACACTTCTG GTTGGGGCAT GGACCCCTCT GGGAGCTCAG AACCCCATTT CGTGGGAGGT 
241 GCAGCGATTT GATGGGTGGT ACAACAACCT CATGGAGCAC AGATGGGGCA GCAAAGGCTC 
301 CCGGCTGCAG CGCCTGGTCC CAGCCAGCTA TGCAGATGGC GTGTACCAGC CCTTGGGAGA 
361 ACCCCACCTG CCCAACCCCC GAGACCTTAG CAACACCATC TCAAGGGGCC CTGCAGGGCT 
421 GGCCTCCCTG AGAAACCGCA CAGTGTTGGG GGTCTTCTTT GGCTATCACG TGCTTTCAGA 
481 CCTGGTGAGC GTGGAAACTC CCGGCTGCCC CGCCGAGTTC CTCAACATTC GCATCCCGCC 
541 CGGAGACCCC ATGTTCGACC CCGACCAGCG CGGGGACGTG GTGCTGCCCT TCCAGAGAAG 
601 CCGCTGGGAC CCCGAGACCG GACGGAGTCC CAGCAATCCC CGGGACCCGG CCAACCAGGT 
661 GACGGGCTGG CTGGACGGCA GCGCCATCTA TGGTTCCTCG CATTCCTGGA GCGACGCGCT 
721 GCGGAGCTTC TCCAGGGGAC AGCTGGCGTC GGGGCCCGAC CCCGCTTTTC CCCGAGACTC 
781 GCAGAACCCC CTGCTCATGT GGGCGGCGCC CGACCCCGCC ACCGGGCAGA ACGGGCCCC6 
841 GGGGCTGTAC GCCTTCGGGG CAGAGAGAGG GAACCGGGAA CCCTTCCTGC AGGCGCTGGG 
901 CCTGCTCTGG TTCCGCTACC ACAACCTGTG GGCGCAGAGG CTGGCCCGCC AGCACCCAGA 
961 CTGGGAGGAC GAGGAGCTGT TCCAGCACGC ACGCAAGAGG GTCATCGCCA CCTACCAGAA 
1021 CATCGCTGTG TATGAGTGGC TGCCCAGCTT CCTGCAGAAA ACACTCCCGG AGTATACAGG 
1081 A TACCG GCCA TTTCTGGACC CCAGCATCTC CTCAGAGTTC GTGGCGGCCT CTGAGCAGTT 
1141 CCTGTCCACC ATGGTGCCCC CTGGCGTCTA CATGAGAAAT GCCAGCTGCC ACTTCCAGGG 
1201 GGTCATCAAT CGGAACTCAA GTGTCTCCAG AGCTCTCCGG GTCTGCAACA GCTACTGGAG 
1261 CCGTGAGCAC CCAAGCCTAC AAAGTGCTGA AGATGTGGAT GCACTGCTGC TGGGCATGGC 
1321 CTCCCAGATC GCAGAGCGAG AGGACCATGT GTTGGTTGAA GATGTGCGGG ATTTCTGGCC 
1381 TGGGCCACTG AAGTTTTCCC GCACAGACCA CCTGGCCAGC TGCCTGCAGC GGGGCCGGGA 
1441 TCTGGGCCTG CCCTCTTACA CCAAGGCCAG GGCAGCACTG GGCTTGTCTC CCATTACCCG 
1501 CTGGGAGGAC ATCAACCCTG CACTCTCCCG GAGCAATGAC ACTGTACTGG AGGCCACAGC 
1561 TGCCCTGTAC AACCAGGACT TATCCTGGCT AGAGCTGCTC CCTGGGGGAC TCCTGGAGAG 
1621 CCACCGGGAC CCTGGACCTC TGTTCAGCAC CATCGTCCTT GAACAATTTG TGCGGCTACG 
1681 GGATGGTGAC CGCTACTGGT TTGAGAACAC CAGGAATGGG CTGTICTCCA AGAAGGAGAT 
1741 TGAAGAAATC C6AAATACCA CCCTGCAGGA CGTGCTGGTC GCTGTTATCA ACATTOACCC 
1801 CAGTGCTCTG CAGCCCAAT6 TCTTTGTCTG GCATAAAGGA GACCCCTGTC CGCAGCCGAG 
1861 ACAGCTCAGC ACTGAAGGCC TGCCAGCGTG TGCTCCCTCT GTTGTTCGTG ACTATTTTGA 
1921 GGGCAGTGGA TTTGGCTTCG GGGTCACCAT CGGGACCCTC TGTTGCTTCC CTTTGGTGAG 
1981 CCTGCTCAGT GCCTGGATTG TTGCCCGGCT CCGGATGAGA AATTTCAAGA GGCTCCAGG6 
2041 CCAGGACCGC CAGAGCATCG TGTCTGAGAA GCTCGTGGGA GGCATGGAAG CTTTG6AATG 
2101 GCAAG6CCAC AAGGAGCCCT GCCGGCCCGT GCTTGTGTAC CTGCAGCCCG GGCAGATCCG 
2161 TGTGGTAGAT GGCAGGCTCA CCGTCCTCCG CACCATCCAG CTGCAGCCTC CACAGAAGGT 
2221 CAACTTCGTC CTGTCCAGCA ACCGTCGACG CCGCACTCTG CTGCTCAAGA TCCCCAAGGA 
2281 GTATGACCTG GTGCTGCT6T TTAACTTGGA GGAAGAGCGG CAGGCGCTGG TGGAAAATCT 
2341 CCGGGGA6CT CTGAAGGAGA GCGGGTTGA6 CATCCAGGAG TGGGAGCTGC GGGAGCAGGA 
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2401 GCTGATGAGA GCAGCTGTGA CACGGGAGCA GCGGAGGCAC CTCCTGGAGA CCTTTTTCAG 
2461 GCACCTTTTC TCCCAGGTGC TGGACATCAA CCAGGCCGAC GCAGGGACCC TGCCCCTGGA 
2521 CTCCTCCCAG AAGGTGCGGG A66CCCTGAC CTGTGAGCTG AGCAGGGCCG AGTTTCCCGA 
2581 GTCCCTGGGC CTCAAGCCCC AGGACATGTT TGTGGAGTCC ATGTTCTCTC TGGCTGACAA 
2641 GGATGGCAAT GGCTACCTGT CCTTCCGAGA GTTCCTGGAC ATCCT6GTGG TCTTCATGAA 
2701 AGGCTCTCCT GAGGAAAAGT CTCGCCTTAT GTTCCGCATG TACGACTTTG ATGGGAATGG 
2761 CCTCATTTCC AAGGATGAGT TCATCAGGAT GCTGAGATCC TTCATCGAGA TCTCCAACAA 
2821 CTGCCTGTCC AAGGCCCAGC TGGCTGAGGT GGTGGAGTCC ATGTTCCGGG .AGTCG6GATT 
2881 CCAGGACAAG GAGGAACTGA CATGGGAAGA TTTTCACTTC ATGCTGCGGG ACCACAATAG 
2941 CGAGCTCCGC TTCACGCAGC TCTGTGTCAA AGGGGTG6AG GTGCCTGAAG TCATCAAGGA 
3001 CCTCTGCCGG CGAGCCTCCT ACATCAGCCA GGATATGATC TGTCCCTCTC CCAGAGTGAG 
3061 TGCCCGCTGT TCCCGCAGCG ACATTGAGAC TGAGTTGACA CCTCAGAGAC TGCAGTGCCC 
3121 CATGGACACA GACCCTCCCC AGGAGATTCG GCGGAGGTTT GGCAAGAAGG TAACGTCATT 
3181 CCAGCCCTTG CTGTTCACTG AGGCGCACCG AGAGAAGTTC CAACGCAGCT GTCTCCACCA 
3241 GACGOTGCAA CAGTTCAAGC GCTTCATTGA GAACTACCGG CGCCACATCG GCTGCGTGGC 
3301 CGTGTTCTAC GCCATCGCTG GGGGGCTTTT CCTGGAGAGG GCCTACTACT ACGCCTTTGC 
3361 CGCACATCAC ACGGGCATCA CGGACACCAC CCGCGTGGGA ATCATCCTGT CGCGGGGCAC 
3421 A.GCAGCCAGC ATCTCTTTCA TGTTCTCCTA CATCTTGCTC ACCATGTGCC GCAACCTCAT 
3481 CACCTTCCTG CGAGAAACCT TCCTCAACCG CTACGTGCCC TTCGACGCCG CCGTGGACTT 
3541 CCATCGCCTC ATTGCCTCCA CCGCCATCGT CCTCACAGTC TTACACAGTG TGGGCCATGT 
3601 GGTGAATGTG TACCTGTTCT CCATCAGCCC CCTCAGCGTC CTCTCTTCCC TCTTTCCTGG 
3661 CCTCTTCCAT GATGATGGGT CTGAGTTCCC CCAGAAGTAT TACTGGTCGT TCTTCCA6AC 
3721 CGTACCAGGC CTCACGGGGG TTGTGCTGCT CCTQATCCTG GCCATCATGT ATGTCTTTGC 
3781 CTCCCACCAC TTCCGCCGCC GCAGTTTCCG GGGCTTCTGG CTGACCCACC ACCTCTACAT 
3841 CCTGCTCTAT GTCCTGCTCA TCATCCATGG TAGCTTTGCC CTGATCCAGC TGCCCCGTTT 
3901 CCACATCTTC TTCCTGGTCC CAGCAATCAT CTATGGGGGC GACAAGCTGG TGAGCCTGAG 
3961 CCGGAAGAAG GTGGAGATCA GCGTGGTGAA GGCGGAGCTG CTGCCCTCA6 GAGT6ACCCA 
4021 CCOXSCGGTTC CAGCGGCCCC AGGGCTTTGA GTACAAGTCA GGGCAGTGGG TGCGGATCGC 
4081 TTGCCTGGCT CTGGGGACCA CCGAGTACCA CCCCTTCACA CTGACCTCTG CGCCCCATGA 
4141 GGACACGCTT AGCCTGCACA TCCGGGCAGC AGGGCCCTGG ACCACTCGCC TCAGGGAGAT 
4201 CTACTCAGCC CCGACGGGTG ACAGATGTGC CAGATACCCA AAGCTGTACC TTGATGGACC 
4261 ATTTGGAGAG GGCCACCAGG AGTGGCATAA GTTTGAGGTG TCAGTGTTAG TGGGAGGGGG 
4321 CATTGGGGTC ACCCCTTTTG CCTCCATCCT CAAAGACCTG GTCTTCAAGT CATCCGTCAG 
4381 CTGCCAAGTG TTCTGTAAGA AGATCTACTT CATCTGGGTG ACGCGGACCC AGCGTCAGTT 
4441 TGAGTGGCTG GCTGACATCA TCCGAGAGGT GGAG6AGAAT GACCACCAGG ACCTGGTGTC 
4501 TGTGCACATC TACATCACCC AGCTGGCTGA GAAGTTCGAC CTCAGGACCA CTATGCTGTA 
4561 CATCTGTGAG CGGCACTTCC AGAAGGTTCT GAACCGGAGT CTATTCACAG GCCTGCGCTC 
4621 CATCACCCAC TTTGGCCGTC CCCCCTTTGA GCCCTTCTTC AACTCCCTGC AGGAGGTCCA 
4681 CCCCCAGGTC CGGAAGATCG GGGTGTTTAG CTGTGGCCCC CCTGGCATGA CCAAGAATGT 
4741 GGAAAAGGCC TGTCAGCTCA TCAACAGGCA GGACCGGACT CACTTCTCCC ACCATTATGA 
4801 GAACTTCTAG GCCCCTGCCC GGGGGTTCTG CCCACTGTCC AGTTGAGCAG AGGTTTQAGC 
4861 CCACACCTCA CCTCTGTTCT TCCTATTTCT GGCTGCCTCA GCCTTCTCTG ATTTCCCACC 
4921 TCCCAACCTT GTTCCAGGTG GCCATAGTCA GTCACCATGT GTGGGCTCAG GGACCCCCAG 
4981 GACCAGGATG TGTCTCAGCC TGGAGAAATG GTGGGGGGGC AGTGTCTAGG GACTAGAGTG 
5041 A6AAGTAGGG GAGCTACTGA TTTGGGGCAA AGTGAAACCT CTGCTTCAGA CTTCAGAAAC 
5101 AAATCTCAGA AGACAAGCTG ACCTGACAAG TACTATGTGT GTGCATGTCT GTATGTGTGT 
5161 TGGGGCGGTG AGTGTAAGGA TGCAGTGGGA GCATGGATGC TGGCATCTTA GAACCCTCCC 
5221 TACTCCCATA CCTCCTCCTC TTCTGGGCTC CCCACTGTCA 6ACGGGCTGG CAAATGCCTT 
5281 GCAGGAGGTA GAGGCTGGAC CCATGGCAAG CCATTTACAG AAACCCACTC GGCACCCCAG 
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5341 TCTAACACCA CAACTAATTT CACCCAAGGT TTTAAGCACG TTCTTTCATC AGACCCTGGC 
5401 CCAATACCTA TGTATGCAAT GCTCCTCAGC CCTCTTCTCC CTGCTCCAGT AGTCTCCCTT 
5461 CCAAATAAAT CACTTTTCTG CCAAAAAAAA AAAA 
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<210> 46 
<211> 1521 
<212> PRT 

<213> Homo sapaens 
<4ao> 46 
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1 MGFCLALAWT LLVGAWTPLG AQNPISWEVQ 
61 DGVYQPtiGEP HLPNPRDLSN TISRGPAGLA 
121 EFLNIRIPPG DPMFDPDQRG DWLPFQRSR 
181 SSHSWSDALR SFSRGQLASG PDPAFPRDSQ 
241 REPFLQALGL LWFRYHNLWA QRXiARQHPDW 
301 OKTLPEYTGY RPFLDPSISS EFVAASEQFL 
361 LRVCNSYWSR EHPSLQSAED VDALLL6MAS 
421 ASCLQRGRDL GLPSYTKARA ALGLSPITRW 
481 LLPGGLLESH RDPGPLFSTI VLEQFVRLRD 
541 LVAVINIDPS ALQPNVFVWH KGDPCPQPRQ 
601 TLCCFPLVSL LSAWXVARLR MRNFKRLQGQ 
661 VYLQPGQIRV VDGRIiTVLRT IQLQPPQKVN 
721 ERQALVEMLR GALKESGLSI QEWELREQEL 
781 ADAGTLPLDS SQKVREALTC ELSRAEFAES 
841 LDILWFMKG SPEEKSRLMF RMYDFDGNGL 
901 ESMFRESGFQ DKEELTWEDF HFMLRDHNSE 
961 MICPSPRVSA RCSRSDIETE LTPQRLQCPM 
1021 KFQRSCLHQT VQQFKRFIEN YRRHIGCVAV 
1081 VGIILSRGTA ASISFMFSYI LLTMCRNI-IT 
1141 TVLHSVGHW NVYLFSISPL SVLSCLFPGL 
1201 ILAIMYVFAS HHFRRRSPRG FWLTHHLYIL 
1261 GGDKLVSLSR KKVEISWKA ELLPSGVTHIi 
1321 FTLTSAPHED TLSLHIRAAG PWTTRLREIY 
1381 EVSVLVGGGI GVTPFASILK DLVFKSSVSC 
1441 ENDHQDLVSV HIYITQLAEK FDI#RTTMLYI 
1501 FFNSLQEVHP QVRKIGVFSC GPPGMTKNVE 



RFDGWYNNLM EHRWGSKGSR LQRLVPASYA 
SLRNRTVLGV FFGYHVLSDL VSVETPGCPA 
WDPETGRSPS NPRDPANQVT GWLDGSAIYG 
NPLLMWAAPO PATGQNGPRG LYAFGAERC^ 
EDEELFQHAR KRVIATYQMI AVYEWLPSFL 
STMVPPGVYM RNASCHFQGV INRNSSVSRA 
QIAEREDHVL VEDVRDFWPG PLKFSRTDHL 
QDXNPALSRS NDTVLEATAA , LYNQDLSWLE 
GDRYWFENTR NGLFSKKEIE EIRIOTTLQDV 
LSTEGLPACA PSWRDYFEG SGFGFGVTIG 
DRQSXVSEKL VGGMEALE«7Q GHKEPCRPVL 
FVLSSNRGRR T1.LLKIPKEY DLVLLFNLEE 
MRAAVTREQR RHLLETFFRH LFSQVLDINQ 
LGLKPQDMFV ESMFSLADKD GNGYLSFREF 
ISKDEFIRML RSFIEISNNC LSKAQLAEW 
LRFTQLCVKG VEVPEVIKDL CRRASYISQD 
DTDPPQEIRR RFGKKVTSFQ PLIiFTEAHRE 
FYAIAGGLFL ' ERAYYYAFAA HHTGITDTTR 
FLRETFLNRY VPFDAAVDFH RLIASTAIVL 
FHDDGSEFPQ KYYWWFFQTV PGLTGWLLL 
LYVLLIIHGS FALIQLPRFH IFFLVPAIIY 
RFQRPQGFEY KSGQWVRIAC LALGTTEYHP 
SAPTGDRCAR YPKLYLDGPF GEGHQEWHKF 
QVFCKKIYFI VAmiTQRQFE WIADIIREVE 
CERHFQKVLN RSLFTGLRSI THFGRPPFEP 
KACQLINRQD RTHFSHHYEN P. 
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<210> 47 
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<211> 3453 

<2X2> DKA 

<213> Homo sapiens 

<220> 
<221> CDS 

<222> (438) . . (3134) 
<400> 47 

gtcctcgacc agtttgtacg gctgcgggat ggtgaccgct actggtttga gaacaccagg 60 

aatgggctgt tctccaagaa ggagattgag acatccgaaa tiaccaccgtig cgggacgtgc 120 

tggtcgctgt tatcaacatt gaccccagtg ccctgcagcc caatgtcttt gtctggcata 180. 

aaggtgcacc ctgccctcaa cctaagcagc tcacaactga cggcctgccc cagtgtgcac 240 

ccctgactgt gcttgacttc tttgaaggca gcagccctgg ttttgccatc accatcattg 300 

ctctctgctg ccttccctta gtgagtctgc ttctctctgg agtggtggcc tatttccggg 360 

gccgagaaca caagaagcta caaaagaaac tcaaagagag cgtgaagaag gaagcagcca 420 

aagatggagt gccagcg at.g gag tgg cca ggc ccc aag gag agg age agt 470 
Met Glu Trp Pro Gly Pro Lys Glu Arg Ser Ser 
1 5 10. 

ccc ate ate ate eag etg etg tea gac agg tgt ctg cag gte etg aac 518 
Pro lie lie lie Gin Leu Leu Ser Asp Arg Cys Leu Gin Val Leu Asn 
15 20 . 25 

agg cat etc act gtg etc cgt gtg gtc cag ctg cag cct ctg cag eag 566 
Arg His Leu Thr Val Leu Arg Val Val Gin Leu Gin Pro Leu Gin Gin 
30 35 40 

gtc aac etc ate ctg tec aac aac ega gga tgc cgc ace etg ctg etc 614 
Val Asn Leu lie Leu Ser Asn Asn Arg Gly Cys Arg Thr Leu Leu Leu 
45 50 55 

aag ate cct aag gag tat gac ctg gtg ctg ctg ttt agt tct gaa gag 662 
Lys lie Pro Lys Glu Tyr Asp Leu Val Leu Leu Phe Ser Ser Glu Glu 
60 65 70 75 

gaa egg ggc gcc ttt gtg cag eag eta tgg gac ttc tgc gtg cgc tgg 710 
Glu Arg Gly Ala Phe Val Gin Gin Leu Trp Asp Phe Cys Val Arg Trp 
80 85 90 

get ctg ggc etc eat gtg get gag atg age gag aag gag eta ttt agg 758 
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Ala Leu 61y Leu His Val Ala Glu Met ser Glu Lys Glu Leu Phe Arg 
95 100 105 

aag get gtg aca aag cag cag egg gaa cgc ate ctg gag ate ttc ttc 
Lys Ala Val Thr Lys Gin Gin Arg Glu Arg lie Leu Glu lie Phe Phe 
110 115 120 

aga cac ctt ttt get cag gtg ctg gac ate aac cag gee gae gea ggg 
Arg His Leu Phe Ala Gin Val Leu Asp lie Asn Gin Ala Asp Ala Gly 
125 130 135 

acc ctg ccc ctg gac tec tec cag aag gtg egg gag gee etg aec tgc 
Thr Leu Pro Leu Asp Ser Ser Gin Lys Val Arg Glu Ala Leu Thr Cys 
140 145 150 155 

gag etg age agg gee gag ttt gee gag tec ctg ggc etc aag ccc cag 
Glu Leu Ser Arg Ala Glu Phe Ala ^Glu Ser Leu Gly Leu Lys Pro Gin 
160 165 170 

gac atg ttt gtg gag tec atg ttc tct ctg get gac aag gat ggc aat 
Asp Met Phe Val Glu Ser Met Phe Ser Leu Ala Asp Lys Asp Gly Asn 
175 180 185 

ggc tac etg tec ttc cga gag ttc ctg gac ate etg gtg gtc ttc atg 
Gly Tyr Leu Ser Phe Arg Glu Phe Leu Asp lie Leu Val Val Phe Met 
190 195 200 

aaa ggc tee cea gag gat aag tec cgt eta atg ttt ace atg tat gac 
Lys Gly Ser Pro Glu Asp Lys Ser Arg Leu Met Phe Thr Met Tyr Asp 
205 210 215 

ctg gat gag aat ggc ttc etc tec aag gac gaa ttc ttc acc atg atg 
Leu Asp Glu Asn Gly Phe Leu Ser Lys Asp Glu Phe Phe Thr Met Met 
220 225 230 235 



cga tee ttc ate gag ate tec aac aac tgc ctg tec aag gee cag ctg 
Arg Ser Phe lie Glu lie Ser Asn Asn Cys Leu Ser Lys Ala Gin Leu 
240 245 250 

gcc gag gtg gtg gag tct atg ttc egg gag teg gga ttc cag gac aag 
Ala Glu Val Val Glu Ser Met Phe Arg Glu Ser Gly Phe Gin Asp Lys 
255 260 265 

gag gag ctg aca tgg gag gat ttt cac ttc atg ctg egg gac cat gac 
Glu Glu Leu Thr Trp Glu Asp Phe His Phe Met Leu Arg Asp His Asp 
270 275 280 



age gag etc cgc ttc aeg cag etc tgt gtc aaa ggt gga ggt gga ggt 
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806 



854 



902 



950 



998 



1046 



1094 



1142 



1190 



1238 



1286 



1334 



52 



Vy^O 00/28031 

Ser Glu Leu Arg Phe Thr Gin Leu Cys Val Lys Gly Gly Gly Gly Gly 
205 290 295 
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gga aat ggt att aga gat ate ttt aaa caa aac ate age tgt cga gtc 
Gly Asn Gly He Arg Asp He Phe Lys Gin Asn He Ser Cys Arg Val 
300 305 310 315 



1382 



teg ttc ate act egg aea cct ggg gag cgc tec cac ccc cag gga ctg 
Ser Phe He Thr Arg Thr Pro Gly Glu Arg Ser His Pro Gin Gly Leu 
320 325 330 



1430 



ggg ccc cct gtc cca gaa gcc cca gag ctg gga gge ect gga ctg aag 
Gly Pro Pro Val Pro Glu Ala Pro Glu Leu Gly Gly Pro Gly Leu Lys 
335 340 345 



1478 



aag agg ttt ggc aaa aag gca gca gtg ccc act ccc egg ctg tac aca 
Lys Arg Phe Gly Lys Lys Ala Ala Val Pro Thr Pro Arg Leu Tyr Thr 
350 355 360 



1526 



gag geg ctg caa gag aag atg cag ega ggc ttc eta gee caa aag ctg 
Glu Ala Leu Gin Glu Lys Met Gin Arg Gly Phe Leu Ala Gin Lys Leu 
365 370 375 



1574 



cag cag tac aag cgc ttc gtg gag aac tac egg agg cac ate gtg tgt 
Gin Gin Tyr Lys Arg Phe Val Glu Asn Tyr Arg Arg His He Val Cys 
380 385 390 395 



1622 



gtg gca ate ttc teg gee ate tgt gtt gge gtg ttt gca gat cgt get 
Val Ala He Phe Ser Ala He Cys Val Gly Val Phe Ala Asp Arg Ala 
400 405 410 



1670 



tac tac tat ggc ttt gee ttg cca ccc teg gac att gca cag ace ace 
Tyr Tyr Tyr Gly Phe Ala Leu Pro Pro Ser Asp He Ala Gin Thr Thr 
415 420 425 



1718 



etc gtg ggc ate ate ctg tea cga ggc acg geg gee age gtc tec ttc 
Leu Val Gly He He Leu Ser Arg Gly Thr Ala Ala Ser Val Ser Phe 
430 435 440 



1766 



atg ttc tet tat ate ttg etc ace atg tgc cgc aac etc ata ace ttc 
Met Phe Ser Tyr He Leu Leu Thr Met Cys Arg Asn Leu He Thr Phe 
445 450 455 



1814 



ctg cga gag act ttc etc aac cgc tat gtg cct ttt gat gee gca gtg 
Leu Arg Glu Thr Phe Leu Asn Arg Tyr Val Pro Phe Asp Ala Ala Val 
460 465 470 475 



1862 



gac ttc cac cgc tgg ate gee atg get get gtt gtc ctg gcc att ttg 



1910 
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Asp Phe His Arg Trp lie Ala Met Ala Ala Val Val Leu Ala lie Leu 
480 485 490 

cac agt get ggc cac gca gtc aat gtc tac ate ttc tea gtc age cca 1958 
His Ser Ala Gly His Ala Val Asn Val Tyr lie Phe Ser Val Ser Pro 
495 500 505 

etc age ctg ctg gee tgc ata tte cec aac gtc ttt gtg aat gat ggg 2006 
Leu Ser Leu Leu Ala Cys lie Phe Pro Asn Val phe Val Asn Aso Gly 
510 515 520 

tec aag ctt cec cag aag ttc tat tgg tgg ttc ttc eag acc gtc cca 2054 
Ser Lys Leu Pro Gin Lys Phe Tyr Trp Trp Phe Phe Gin Thr Val Pro 
525 530 535 



ggt atg aca ggt gtg ctt ctg etc ctg 

Gly Met Thr Gly Val Leu Leu Leu Leu 

540 545 

ttc gee tec cac cac ttc cge egc cgc 

Phe Ala Ser His His Phe Arg Arg Arg 
560 

acc cac cac etc tac ate ctg etc tat 

Thr His His Leu Tyr lie Leu Leu Tyr 

575 580 



gtc ctg gee ate atg tat gtc 2102 

Val Leu Ala lie Met Tyr Val 
550 555 

age ttc egg ggc tte tgg ctg 2150. 

Ser Phe Arg Gly Phe Trp Leu 
565 570 

gee ctg etc ate ate cat ggc 2198 

Ala Leu Leu lie lie. His Gly 
585 



age tat get ctg ate eag 
Ser Tyr Ala Leu lie Gin 
590 

ceg gca ate ate tat gga 
Pro Ala lie lie Tyr Gly 
605 

aag gtg gag ate age gtg 
Lys Val Glu lie Ser Val 
620 625 



ctg cec act tte cac ate 

Leu Pro Thr Phe His lie 
•595 

ggt gac aag ctg gtg age 

Gly Asp Lys Leu Val Ser 

610 615 

gtg aag gcg gag ctg ctg 

Val Lys Ala Glu Leu Leu 
630 



tac ttc ctg gtc 2246 

Tyr Phe Leu Val 

600 

ctg age egg aag 2294 
Leu Ser Arg Lys 



cec tea gga gtg 2342 
Pro Ser Gly Val 
635 



acc tac ctg caa ttc cag agg cec caa ggc ttt gag tac aag tea gga * 2390 
Thr Tyr Leu Gin Phe Gin Arg Pro Gin Gly Phe Glu Tyr Lys Ser Gly 
640 645 650 



eag tgg gtg egg ate gee tgc ctg get ctg ggg acc acc gag tac cac 2438 
Gin Trp Val Arg lie Ala Cys Leu Ala Leu Gly Thr Thr Glu Tyr His 
655 660 665 



cec tte aca ctg ace tee gcg cec cat gag gac aca etc age ctg cac 2486 
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Pro Phe Thr Leu Thr Ser Ala Pro His Glu Asp Thr Leu Ser Leu His 
670 675 680 

ate egg gca gtg ggg ccc tgg acc act cgc etc agg gag ate tac tea 2534 
lie Arg Ala Val Gly Pro Trp Thr Thr Arg Leu Arg Glu lie Tyr Ser 
€85 690 695 

tec cca aag ggc aat ggc tgt get gga tac cea aag ctg tac ett gat 2582 
Ser Pro Lys Gly Asn Gly cys Ala Gly Tyr Pro Lys Leu Tyr Leu Asp 
700 705 710 715 

gga eeg ttt gga gag ggc cat cag gag tgg cat aaa ttt gag gtg tea 2630 
Gly Pro Phe Gly Glu Gly His Gin Glu Trp His Lys Phe Glu Val Ser 
720 725 730 

gtg ttg gtg gga ggg ggc att ggg gtc acc ccc ttt gcc tec ate etc 2678 
Val Leu Val Gly Gly Gly He Gly Val Thr Pro Phe Ala Ser He Leu 
735 740 745 

aaa gae ctg gtc ttc aag tea tec ttg ggc age caa atg ctg tgt aag 2726 
Lys Asp Leu Val Phe Lys Ser Ser Leu Gly Ser Gin Met Leu Cys Lys 
750 755 760 

aag ate tac ttc ate tgg gtg aca egg acc cag cgt cag ttt gag tgg 2774 
Lys He Tyr Phe He Trp Val Thr Arg Thr Gin Arg Gin Phe Glu Trp 
765 770 775 

ctg get gae ate ate caa gag gtg gag gag aac gac eac cag gae ctg 2822 
Leu Ala Asp He He Gin Glu Val Glu Glu Asn Asp His Gin Asp Leu 
780 785 790 795 

gtg tet gtg cac att tat gtc acc cag ctg get gag aag ttc gac etc 2870 
val Ser Val His He Tyr Val Thr Gin Leu Ala Glu Lys Phe Asp Leu 
800 805 810 

agg acc acc atg eta tac ate tgc gag egg cac ttc cag aaa gtg ctg 2918 
Arg Thr Thr Met Leu Tyr He Cys Glu Arg His Phe Gin Lys Val Leu 
815 820 825 

aac egg agt ctg ttc aeg ggc ctg cgc tec ate acc cac ttt ggc cgt 2966 
Asn Arg Ser Leu Phe Thr Gly Leu Arg Ser He Thr His Phe Gly Arg 
830 835 840 

ccc ccc ttc gag ccc ttc ttc aac tec ctg cag gag gtc cac cca cag 3014 
Pro Pro Phe Glu Pro Phe Phe Asn Ser Leu Gin Glu Val His Pro Gin 
845 850 855 



gtg cgc aag ate ggg gtg ttc age tgc ggc ect cea gga atg acc aag 



3062 
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Val Arg Lys lie Gly Val Phe Ser Cys Gly Pro Pro Gly Met Thr Lys 
860 865 870 875 

aat gta gag aag gcc tgt cag etc gtc aac agg cag gac cga gcc cac 3110 
Asn Val Glu Lys Ala Cys Gin Leu Val Asn Arg Gin Asp Arg Ala His 
880 885 890 

ttc atg cac cac tat gag aac ttc tgagcctgtc ctccctggct gctgcttcca 3164 
Phe Met His His Tyr Glu Asn Phe 
895 

gtatcctgcc ttctcttctg tgcacctaag ttgcccagcc ctgctggcaa tctctccatc 3224 

agaatccacc ttaggcctca gctggagggc tgcagagccc ctcccaatat tgggagaata 3284 

ttgacccaga caattataca aatgagaaaa ggcattaaaa tttacgtttc tgatgatggc 3344 

aaagctcatt tttct'attag taactctgct gaagatccat ttattgcaat tcatgctgaa 3404 

tctaaattgt aaaatttaaa attaaatgca tgtcctcaaa aaaaaaaaa 3453 



<210> 48 
<211> 899 
<212> PRT 

<213> Homo sapiens 
<400> 48 

Met Glu Trp Pro Gly Pro Lys Glu Arg Ser Ser Pro lie lie lie Gin 
1 5 . 10 15 

Leu Leu Ser Asp Arg Cys Leu Gin Val Leu Asn Arg His Leu Thr Val 
20 25 30 

Leu Arg Val Val Gin Leu Gin Pro Leu Gin Gin Val Asn Leu lie Leu 
35 . 40 45 

Ser Asn Asn Arg Gly Cys Arg Thr Leu Leu Leu Lys lie Pro Lys Glu 
50 55 60 

Tyr Asp Leu Val Leu Leu Phe Ser Ser Glu Glu Glu Arg Gly Ala Phe 
65 70 75 80 

Val Gin Gin Leu Trp Asp Phe Cys Val Arg Trp Ala Leu Gly Leu His 
85 90 95 

Val Ala Glu Met Ser Glu Lys Glu Leu Phe Arg Lys Ala Val Thr Lys 
100 105 110 
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Gin Gin Arg Glu Arg lie Leu Glu He Phe Phe Arg His Leu Phe Ala 
115 120 125 

Gin Val Leu Aap He Asn Gin Ala Asp Ala Gly Thr Leu Pro Leu Asp 
130 135 140 

Ser Ser Gin Lys Val Arg Glu Ala Leu Thr Cys Glu Leu Ser Arg Ala 
145 150 155 160 

Glu Phe Ala Glu Ser Leu Gly Leu Lys Pro Gin Asp Met Phe Val Glu 
165 170 175 

Ser Met Phe Ser Leu Ala Asp Lys Asp Gly Asn Gly Tyr Leu Ser Phe 
180 185 190 

Arg Glu Phe Leu Asp He Leu Val Val Phe Met Lys Gly Ser Pro Glu 
195 200 205 

Asp Lys Ser Arg Leu Met Phe Thr Met Tyr Asp Leu Asp Glu Asn Gly 
210 215 220 

Phe Leu Ser Lys Asp Glu Phe Phe Thr Met Met Arg Ser Phe He Glu 
225 230 235 240 

He Ser Asn Asn Cys Leu Ser Lys Ala Gin Leu Ala Glu Val Val Glu 
245 250 255 

Ser Met Phe Arg Glu Ser Gly Phe Gin Asp Lys Glu Glu Leu Thr Trp 
260 265 270 

Glu Asp Phe His Phe Met Leu Arg Asp His Asp Ser Glu Leu Arg Phe 
275 280 285 

Thr Gin Leu Cys Val Lys Gly Gly Gly Gly Gly Gly Asn Gly He Arg 
290 295 300 

Asp He Phe Lys Gin Asn He Ser Cys Arg Val Ser Phe He Thr Arg 
305 310 315 320 

Thr Pro Gly Glu Arg Ser His Pro Gin Gly Leu Gly Pro Pro Val Pro 
325 330 335 

Glu Ala Pro Glu Leu Gly Gly Pro Gly Leu Lys Lys Arg Phe Gly Lys 
340 345 350 

Lys Ala Ala Val Pro Thr Pro Arg Leu Tyr Thr Glu Ala Leu Gin Glu 
355 360 365 
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Lys Met Gin Arg Gly Phe Leu Ala Gin Lys Leu Gin Gin Tyr Lys Arg 
370 375 380 

Phe Val Glu Asn Tyr Arg Arg His He Val Cys Val Ala He Phe Ser 
385 390 395 400 

Ala He Cys Val Gly Val Phe Ala Asp Arg Ala Tyr Tyr Tyr Gly Phe 
405 410 415 

Ala Leu Pro Pro Ser Asp He Ala Gin Thr Thr Leu Val Gly He He 
420 425 430 

Leu Ser Arg Gly Thr Ala Ala Ser Val Ser Phe Met Phe Ser Tyr He 
435 440 445 

Leu Leu Thr Met Cys Arg Asn Leu He Thr Phe Leu Arg Glu Thr Phe 
450 455 460 

Leu Asn Arg Tyr Val Pro Phe Asp Ala Ala Val Asp Phe His Arg Trp 
465 470 475 480 

He Ala Met Ala Ala Val Val Leu Ala He Leu His Ser Ala Gly His 
485 490 495 

Ala Val Asn Val Tyr He Phe Ser Val Ser Pro Leu Ser Leu Leu Ala 
500 505 510 

Cys He Phe Pro Asn Val Phe Val Asn Asp Gly Ser Lys Leu Pro Gin 
515 520 525 

Lys Phe Tyr Trp Trp Phe Phe Gin Thr Val Pro Gly Met Thr Gly Val 
530 535 540 

Leu Leu Leu Leu Val Leu Ala He Met Tyr Val Phe Ala Ser His His 
545 550 555 560 

Phe Arg Arg Arg Ser Phe Arg Gly Phe Trp Leu Thr His His Leu Tyr 
565 570 575 

He Leu Leu Tyr Ala Leu Leu He He His Gly Ser Tyr Ala Leu He 
580 585 . 590 

Gin Leu Pro Thr Phe His He Tyr Phe Leu Val Pro Ala He He Tyr 
595 600 605 

Gly Gly Asp Lys Leu Val Ser Leu Ser Arg Lys Lys Val Glu He Ser 
610 615 620 
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Val Val Lys Ala Glu Leu Leu Pro ser Gly Val Thr Tyr Leu Gin Phe 
625 630 635 640 

Gin Arg Pro Gin Gly Phe Glu Tyr Lys Ser Gly Gin Trp Val Arg lie 
645 650 655 

Ala Cys Leu Ala Leu Gly Thr Thr Glu Tyr His Pro Phe Thr Leu Thr 
660 665 670 

Ser Ala Pro His Glu Asp Thr Leu Ser Leu His lie Arg Ala Val Gly 
675 680 685 

Pro Trp Thr Thr Arg Leu Arg Glu lie Tyr Ser Ser Pro Lys Gly Asn 
690 695 700 

Gly Cys Ala Gly Tyr Pro Lys Leu Tyr Leu Asp Gly Pro Phe Gly Glu 
705 710 715 720 

Gly His Gin Glu Trp His Lys Phe Glu Val Ser Val Leu Val Gly Gly 
725 730 735 

Gly lie Gly Val Thr Pro Phe Ala Ser lie Leu Lys Asp Leu Val Phe 
740 745 750 

Lys Ser Ser Leu Gly Ser Gin Met. Leu Cys Lys Lys lie Tyr Phe lie 
755 760 765 

Trp Val Thr Arg Thr Gin Arg Gin Phe Glu Trp Leu Ala Asp lie lie 
770 775 780 

Gin Glu Val Glu Glu Asn Asp His Gin Asp Leu Val Ser Val His lie 
785 790 795 ' 800 

Tyr Val Thr Gin Leu Ala Glu Lys Phe Asp Leu Arg Thr Thr Met Leu 
805 810 815 

Tyr lie Cys Glu Arg His Phe Gin Lys Val Leu Asn Arg Ser Leu Phe 
820 825 630 

Thr Gly Leu Arg Ser lie Thr His Phe Gly Arg Pro Pro Phe Glu Pro 
835 840 845 

Phe Phe Asn Ser Leu Gin Glu Val His Pro Gin Val Arg Lys lie Gly 
850 855 860 

Val Phe Ser Cys Gly Pro Pro Gly Met Thr Lys T^n Val Glu Lys Ala 
865 870 875 880 
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Cys Gin Leu Val Asn Arg Gin Asp Arg Ala His Phe Met His His Tyr 
885 890 895 



Glu Asn Phe 



<210> 49 
<211> 26 
<212> DMA 

<213> Artificial. Sequence 

<220> 

<223> Description of Artificial Sequence: Primer 
<400> 49 

cctgacagat gtatttcact acccag 26 



<210> 50 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 50 

ggatcggagt cactcccttc gctg 24 



<210> 51 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 51 

ctagaagctc tccttgttgt aataga 26 



<210> 52 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
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<220> f 

<223> Description of Artificial Sequence: Primer 
<400> 52 

atgaacacct ctggggtcag ctga 24 

<210> 53 
<211> 24 
<212> DNA. 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 53 

atgaacacct ctggggtcag ctga 24 



<210> 54 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 54 

gtcctctgca gcattgttcc tctta 25 



<210> 55 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 55 

cctgacagat gtatttcact acccag .26 



<210> 56 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: Primer 
<400> 56 

ggatcggagt cactcccttc gctg 

<210> 57 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 57 

aatgacactg tactggaggc cacag 

<210> 58 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 58 

ctgccatcta ccacacggat ctgc 

<210> 59 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 59 

cttgccattc caaagcttcc atgc 

<210> 60 
<211> 24 
<212> DNA 

<213> Artificial Sequence 



62 



Wp 00/28031 



PCT/US99/26592 



<220> 

<223> Description of Artificial Sequence: Primer 
<400> 60 

gtacaagtca ggacagtggg tgcg 

<210> 61 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Printer 
<400> 61 

tggatgatgt cagccagcca ctca 
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effects of the compound/composition. 
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This International Searching Authority found multiple (groups of) 
inventions in this international application, as follows: 

1. Claims: 1-16 all partially 

A protein capable of stimulating superoxide production, 
wherein the protein comprises a moxl, preferably with the 
sequence of SEQ ID NO 2, SEQ ID NO 21, SEQ ID NO 42, a 
fragment thereof or a conservative substitution thereof. 
A nucleotide sequence, preferably with the sequence of SEQ 
ID NO 1, SEQ ID NO 22 or SEQ ID NO 41 encoding for the above 
mentioned protein, fragment thereof or conservative 
substitution. 

A vector comprising said nucleotide sequence and a cell 
containing said vector. 

An antibody capable of binding the above mentioned protein, 
fragment or conservative substitution. A method of 
stimulating superoxide formation, in vitro or in vivo, 
comprising administration, in vitro or in vivo, of a 
composition comprising the abovementioned vector or the 
above mentioned protein or its fragment or its conservative 
substitution in a pharmaceutical ly acceptable vector. A 
method for determining the activity of a drug comprising 
measuring the activity of the above mentioned protein to 
stimulate superoxide formation following administration of 
the drug. 



2. Claims: 1-16 all partially 

As invention 1, but for a mox2 with SEQ ID NO: 4 and SEQ ID 
NO: 3 



3. Claims: 1-16 all partially 

As invention 1, but for a duoxl with SEQ ID NO: 46 and SEQ 
ID NO: 45 and for a duox2 with SEQ ID NO: 48 and SEQ ID NO: 
47 
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